<?r 


AD-A239  858 


TION'  PAG 


December  1990 


4  TITLE  AND  SUBTiTlt 

A  Mathematical  Model  for  a  Paired  Comparison 
on  a  Continuum  of  Response 

Experiment 

C  AUTHORS) 

1 - 

if..  • 

Kaye  A.  de  Ruiz 

- 

[  V.  PtRf  OWN*.  OKCAi.'l.'Mi:  *N  NAr/.J  <S)  AND  ADD^LSSUS) 

AFIT  Student  Attending:  University  of  Calif ornia-Riverside 

f 

t 

9.  SPONSORING  'MONITORING  AGENCY  NAME(S)  AND  ADDREESfES) 

AFIT/CI 

|  Wright-Patterson  AFB  OH  45433-6583 

3  REPORT  TYPE  AND  DATES  COVERED 

TKEKKX/ DISSERTATION 


5.  f UNDING  NUMliFf.S 


R.  I'SW  ORMlNG  ORGANIZATION 
REPORT  NUMBER 


122.  DISTRIBUTION /AVAILABILITY  STATEMENT 

Approved  for  Public  Release  IAW  190-1 
Distributed  Unlimited 
ERNEST  A.  HAYGOOD,  1st  Lt,  USAF 
Executive  Officer 


13.  ABSTRACT  (Maximum  200  words) 


10.  SPONSORING  MONITORING 
AGENCY  REPORT  NUMBER 


/ftt 

•  i  r~  r  r 


91-09303 


lii  Hill  nillll  ill 


16.  PRICE  COD 


117.  SECURITY  CLASSIFICATION  18.  SECURITY  CLASSIFICATION  19.  SECURITY  CLASSIFICATION  20.  LIMITATION  OF  ABSTRACT 
OF  REPORT  Of  THIS  PAGE  OF  ABSTRACT 


N  75~0-0  ’■  •■’80  5  BOO 


yi  8  V!)  0  A  9 


^93  .'-39' 


UNIVERSITY  OF  CALIFORNIA 
RIVERSIDE 


A  Mathematical  Model  for  a  Paired  Comparison 
Experiment  on  a  Continuum  of  Response 


A  Dissertation  submitted  in  partial  satisfaction 
of  the  requirements  for  the  degree  of 

Doctor  of  Philosophy 

0  ;  ln 

Applied  Statistics 

by 

Kaye  A.  de  Ruiz 
December  1990 


Dissertation  Committee: 

Professor  Robert  J.  Beaver,  Chairperson 
Professor  Barry  C-  Arnold 
Professor  Richard  K.  Eyman 


ACKNOWLEDGEMENTS 


While  earning  my  doctoral  degree,  I  have  received 
much  support  from  all  of  the  students  and  staff  in 
the  statistics  department.  I  would  like  to  give 
special  thanks  to  my  major  professor,  Dr.  Robert 
J.  Beaver  for  his  unceasing  patience  and  constant 
help  and  friendship  throughout  the  research  and 
completion  of  this  dissertation.  My  thanks  also 
go  to  Dr.  Barry  C.  Arnold  and  Dr.  Richard  K.  Eyman 
for  their  assistance  with  the  original  problem 
development  and  with  the  professional  completion 
of  this  paper.  In  addition,  I  wish  to  thank 

Peggy  Franklin  for  her  help  in  long-distance 
coordination  with  various  university  departments 
and  Dale  Anderson  and  Jim  Bentley  for  their 
assistance  in  the  computer  programming  aspects  of 
the  research.  Finally,  I  would  like  to  thank  my 
parents,  Glenn  Abbott  and  Mary  Lea  Abbott  for 
their  support  throughout  all  my  college  years. 


This  dissertation  is  dedicated  to  my  spouse, 

Samue 1 

and  to  our  four  sons, 

Francisco  Xavier 
Casildo  Glenn  Katec 
Esteban  Mi no  Zaragoza 
and 

Quitos  Maximil liano 

for  their  infinite  love  and  patient  understanding. 


iV 


ABSTRACT  OF  THE  D I SSERATA T I ON 


A  Mathemat  i  cal  Model  "for  A  Paired  Comparison 
Experiment  on  a  Continuum  of  Response 


by 

Kaye  A.  de  Ruiz 


\ 


\ 


Doctor  of  Philosophy,  Graduate  Program  in  Applied 

Stat i st i cs 

University  of  California,  Riverside,  December  1990 
Professor  Robert  J.  Beaver,  Chairperson 


This  research  uses  a  Bradley-Terry  paired 
comparison  model,  extending  the  possible  outcomes 
from  a  simple  binary  response  to  any  point  along  a 
continuum.  An  integrated  mean  squared  error 
approach  is  used  as  a  basis  for  parameter 
estimation  based  on  an  underlying  logistic  model 
and  for  estimating  a  function  that  maps  a  response 
from  the  real  line  onto  the  interval  [0,1] . 
Evaluation  of  the  parameter  estimates  is  performed 
using  an  underlying  logistic  sampling  distribution 
with  a  logistic  mapping  function.  The  asymptotic 


-s  -  \  rc 
-j  U  v "  1 


v 


behavior  of  the  estimates  involves  the  use  of 


linear  combinations  of  order  statistics,  also 
called  L-stat i st ics .  Using  this  approach  it  is 
shown  that  the  parameter  estimates  are  consistent 
but  slightly  biased.  A  simulation  study  with 
randomly  generated  values  from  a  standard  logistic 
distribution  were  used  for  response  values  in  a 
paired  comparison  experiment.  Samples  for  3,  4, 

and  5-  treatments  with  5,  10,  25,  and  at*  times  50, 

replications  for.  each  pairing  were  considered. 
The  values  for  the  parameter  estimates  for  these 
samples  are  presented  along  with  the  graphs  of  the 
estimated  inverse  mapping  function.  Density 

smoothing  techniques  using  a  kernel  estimator  were 
employed  to  further  explore  the  shape  of  the 
derivative  of  the  inverse  mapping  function.  This 
continuous  model  was  then  reformulated  into  a 
Likert  scale  with  five  possible  responses  from  any 
comparison.  An  ad  hoc  method  and  weighted  least 
squares  were  used  to  determine  the  discriminating 
values  for  the  Likert  scale. 

\ 

\ 


VI 


ABSTRACT  OF  THE  D I SSERATAT I ON 


A  Mathematical  Model  for  A  Paired  Comparison 
Experiment  on  a  Continuum  of  Response 


by 

Kaye  A.  de  Ruiz 

Doctor  of  Philosophy,  Graduate  Program  in  Applied 

Stat i st i cs 

University  of  California,  Riverside,  December  1990 
Professor  Robert  J.  Beaver,  Chairperson 


This  research  uses  a  Bradley-Terry  paired 
comparison  model,  extending  the  possible  outcomes 
from  a  simple  binary  response  to  any  point  along  a 
continuum.  An  integrated  mean  squared  error 
approach  is  used  as  a  basis  for  parameter 
estimation  based  on  an  underlying  logistic  model 
and  for  estimating  a  function  that  maps  a  response 
from  the  real  line  onto  the  interval  [0,1] - 
Evaluation  of  the  parameter  estimates  is  performed 
using  an  underlying  logistic  sampling  distribution 
with  a  logistic  mapping  function.  The  asymptotic 


v 


behavior  of  the  estimates  involves  the  use  of 
linear  combinations  of  order  statistics,  also 
called  L-stat i st i cs .  Using  this  approach  it  is 
shown  that  the  parameter  estimates  are  consistent 
but  slightly  biased.  A  simulation  study  with 
randomly  generated  values  from  a  standard  logistic 
distribution  were  used  for  response  values  in  a 
paired  comparison  experiment.  Samples  for  3,  4, 
and  5  treatments  with  5,  10,  25,  and  at  times  50, 
replications  for  each  pairing  were  considered. 
The  values  for  the  parameter  estimates  for  these 
samples  are  presented  along  with  the  graphs  of  the 
estimated  inverse  mapping  function.  Dens  ity 
smoothing  techniques  using  a  kernel  estimator  were 
employed  to  further  explore  the  shape  of  the 
derivative  of  the  inverse  mapping  function.  This 
continuous  model  was  then  reformulated  into  a 
Likert  scale  with  five  possible  responses  from  any 
comparison.  An  ad  hoc  method  and  weighted  least 
squares  were  used  to  determine  the  discriminating 
values  for  the  Likert  scale. 


vx 


TABLE  OF  CONTENTS 


Page 


Chapter  1  Introduction  and  Background  1 

Chapter  2  Basic  Model  13 

2.1  Definitiond  and  Notation  13 

2.2  Estimation  of  Parameters 

(without  order)  18 

2.3  Basic  Model  with  order 

parameter  20 

Chapter  3  Use  of  Logistic  Distribution 

in  Basic  Model  24 

3.1  Truncated  Distribution  Approach 

3.1.1  Case  I:  All  Ordered  Pairs  25 

3.1.2  Case  II:  Non  Ordered  Pairs  32 

3.2  Limited  Integral  Approach  37 

3.3  Model  with  an  Order  Parameter  40 

Chapter  4  Properties  of  the  Minimum  Mean 

Square  Error  Parameter  Estimates 

Involving  Order  Statistics  43 

4.1  Properties  of  n ^  using 

Inverse  Logistic  Function  43 

4.2  Application  Using  a  Logistic 

Mapping  Function  55 

4.3  Properties  of  using 

Logistic  Inverse  64 

Chapter  5  Simulation  Study  Using  Parameter 

Estimates  68 

5.1  General  Approach  68 

5.2  Results  on  fi  ^  69 

5.3  Results  on  and  density 

estimation  procedures 

f°r  73 

Chapter  6  Application  to  the  Likert  Scale  88 

6.1  Introduction  88 

6.2  Ad  Hoc  Determination  of  0^’s  90 

6.3  Weighted  Least  Squares 

Determination  of  6  ^  s  95 


vn 


LIST  OF  FIGURES 


Page 

F i gure 

5 . 1 

Phi  Inverse  with  3  Treatments 
and  25  Replications 

74 

F i gure 

5.2 

Phi  Inverse  with  4  Treatments 
and  25  Replications 

75 

F i gure 

5.3 

Phi  Inverse  with  5  Treatments 
and  25  Replications 

75 

F igure 

5.4 

Normal  Density  Smoothing, 
Bandwidth=0 . 1 ,  10  Replications 

77 

F igure 

5.5 

Normal  Density  Smoothing, 
Bandwidth=0 . 2 ,  10  Replications 

77 

F i gure 

5.6 

Normal  Density  Smoothing, 

Bandw idth=0 . 3 ,  10  Replications 

78 

F igure 

5.7 

Normal  Density  Smoothing, 
Bandwidth  =0.4,  10  Replications 

78 

F i gure 

5.8 

Normal  Density  Smoothing, 

Bandw idth=0 . 5 ,  10  Replications 

79 

F igure 

5.9 

Normal  Density  Smoothing, 
Bandwidth=l . 0 ,  10  Replications 

79 

F igure 

5.10 

Normal  Density  Smoothing, 

Bandw i dth=0 . 1 ,  25  Replications 

80 

F igure 

5.11 

Normal  Density  Smoothing, 
Bandwidth  =0.2,  25  Replications 

80 

F i gure 

5.12 

Normal  Density  Smoothing, 
Bandwidt'  =0.3,  25  Replications 

81 

F igure 

5.13 

Normal  Density  Smoothing, 
Bandwidth=0 . 4 ,  25  Replications 

81 

viii 


m 


F i gu  re 

5.14 

Normal  Density 
Bandwidth=0 . 5 , 

Smooth i ng , 

25  Replications 

82 

F i gu  re 

5.15 

Normal  Density 
Bandwidth  =1.0 

Smooth i ng , 

,  25  Replications 

82 

F i gu  re 

5.16 

Normal  Density 
Bandw i dth=0 . 1 , 

Smooth i ng , 

50  Replications 

83 

F igure 

5. 17 

Normal  Density 
Bandw idth=0 . 2 , 

Smooth i ng , 

50  Replications 

83 

F igure 

5. 18 

Normal  Density 
Bandw idth=0 . 3 , 

Smooth i ng , 

50  Replications 

84 

F igure 

5.19 

Normal  Density 
Bandwidth  =0.4 

Smooth i ng , 

,  50  Replications 

84 

F igure 

5.20 

Normal  Density 
Bandwidth=0 . 5 , 

Smooth i ng , 

50  Replications 

85 

F igure 

5.21 

Normal  Density 
Bandwidth  =1.0 

Smooth i ng , 

,  50  Replications 

85 

IX 


LIST  OF  TABLES 


Page 

Tab  1  e 

5.1 

Comparison  of  3  Items 

70 

Table 

Oi 

10 

Comparison  of  4  Items 

71 

Table 

5.3 

Comparison  of  5  Items 

71 

Table 

5.4 

Comparison  of  5  Items 
(different  preset  /j^’s) 

72 

Tab  1  e 

6.1 

d-1  and  0  Estimates 
for  Five  Parameters 

93 

Table 

6.2 

Average  0  ^  Values 

94 

Table 

6.3 

Theta  Values 

104 

x 


CHAPTER  1 


INTRODUCTION  AND  BACKGROUND 

Paired  comparison  experiments  arise  naturally 
through  many  types  of  competitions,  through 
sensory  comparisons  of  items  observed  in  pairs  or 
through  tournaments,  or  through  combinations 
associated  with  the  development  of  experimental 
designs.  The  method  of  paired  comparisons  is 
generally  used  when  items  can  be  judged  only 
subjectively,  and  it  is  impractical  to  measure  the 
responses  on  a  ratio  or  interval  scale  to 
determine  the  preference  ordering.  A  basic  paired 
comparison  experiment  consists  of  treatments, 
Tl'.'Tk’  (k>2)  taken  in  pairs,  with  n  — 
independent  comparisons  of  T^  with  T  • ,  l<i<j<k. 
The  worths  associated  with  these  treatments  are 
Losumed  to  lie  along  some  underlying  continuum  of 
worth.  A  judge  determines  a  preference  for  one  of 
the  two  treatments  in  a  pair,  and  thereby  ranks 
the  treatments  or  objects.  This  determination  can 
be  based  upon  the  intensity  of  the  sensory 
perception  such  as  taste,  feel,  or  sight,  or  upon 


1 


the  closeness  of  the  sensation  to  a  perceived 
optimum  level.  By  compiling  the  results  of  all  n^j 
selections,  a  final  ranking  of  all  treatments  can 
be  obtained  (Bradley,  1976). 

The  method  of  paired  comparisons  dates  back 
to  at  least  1860,  when  Fechner  wrote  of  his 
account  of  judging  which  of  two  vessels  felt 
heavier  under  a  variety  of  circumstances. 
Although  there  were  some  objections  to  the  methods 
he  used  in  conducting  his  experiments,  Fechner’s 
model,  which  was  based  on  a  normal  distribution, 
was  very  similar  to  the  one  used  later  by 
Mosteller  in  1951  (David,  1969). 

Thurstone  (1927)  however,  was  the  first  to 
model  paired  comparisons  using  the  idea  of  a 
subjective  psychological  continuum  whereby  items 
could  be  discriminated  and  ranked  in  order  of 
preference.  Thurstone ’s  law  of  comparative 
judgement  can  be  applied  to  a  range  of  situations 
such  as  comparisons  of  physical  stimulus 
intensities  (weight  or  handwriting  specimens), 
qualitative  comparative  judgement  (excellence  of 
papers  in  an  educational  setting),  or  measurement 


•2 


of  psychological  values  (opinions  on  a  set  of 
public  issues).  Each  judge  employs  a  ‘discriminal 
process’  or  method  of  distinguishing  between 
subjects  or  items  to  determine  the  excellence  or 
value  of  the  subjects.  The  judges  opinion  may 
fluctuate  from  one  replication  to  the  next  when 
judging  an  item  and  this  leads  to  the  formation  of 
a  normal  distribution  along  a  continuum  for  each 
item  being  compared.  When  the  difference  between 
the  means  of  any  two  stimuli  is  large  then  it  is 
easy  to  rank  order  the  items.  If,  however,  the 
means  of  two  items  distribution  are  close  then  it 
is  difficult  to  distinguish  between  the  two  items. 
Using  this  basic  structure,  Tnurstone  presented 
five  different  cases  and  models  which  depend  on 
assumptions  concerning  the  variance  and 
correlation  structure  among  the  treatments  and 
st imu 1 i . 

Mosteller  (1951)  formalized  Thurstone’s 
contributions  by  developing  a  linear  model  for 


paired  comparisons. 

Here  , 

the 

intrinsic  worth  or 

utility  of  each 

treatment , 

Ti’ 

is  located  along  a 

continuum  at  a 

po  i  nt 

Si- 

The 

subject  receives  a 

3 


sensation  in  response  to  the  treatment  and  can 


order  the 
sensat i ons . 
being  selected 


two  treatments 
This  gave  the 


over  Tj  expressed 


based  on 
probabi 1 ity 
as 


th< 


of  T- 


P(Tj-  Tj)  =  PCd.j  >  0] 


1 

■\2n 


I 


e 


-('■i-'Vp 


(1.1.1) 

where  E(d^j)  =  without  loss  of  generality, 

d  i  j~N  (/i  i~/i  j  ,  1 )  and  l<i<j<k.  This  is  based  on  the 
assumption  that  the  sensations  from  a  stimulus  for 
a  population  of  individuals  are  normally 
d  istributed . 

Bradley  and  Terry  (1952)  proposed  a  slightly 
different  model  for  paired  comparisons  of  the  k 
treatments.  The  nonnegative  treatment  parameters 
IIj ,  II2  ,  II3 .  .  .  11^  ,  associated  with  the  k  treatments 
under  investigation,  could  be  represented  on  an 
underlying  continuum  of  worth,  and  identified  only 
up  to  a  scale  parameter.  Each  comparison  can  be 
considered  as  a  Bernoulli  trial  with 


4 


POTj-Tj) 


(1.1.2) 


=  nij  =  n.+n. 

where  II^>  0  and  £11^=1  for  l<i<j<k. 

Equivalently  this  can  be  written  as 

oo 

P  (T  i  — »Tj)  =  1/4  |  sech2  (y/2)  dy  . 

-  (  ln/ii-ln/ij) 

When  ni=  e  the  Brad  1 ey-Terry  model  can  be 

obtained  by  substituting  the  logistic  density  for 
the  normal  density  in  Thurstone’s  model. 
Parameters  can  be  estimated  by  the  method  of 
maximum  likelihood  whereby  the  maximum  likehood 
estimates  satisfy  the  system  of  equations 


n  • 


ij 


'*  JoTi)  (Pi+oj) 


=  0 


(1.1.4) 


i=l,2,...t,  subject  to  the  constraint  £11^  =  1. 
Here  a^  is  the  number  of  selections  favoring  T^, 
and  is  the  estimator  for  11^  with  ^pj  =  l  . 

Bradley  and  Terry  solved  these  equations 
iteratively,  and  used  the  likelihood  ratio 
principle  in  testing  hypotheses.  Conditions  for 
the  existence  of  solutions  to  (1.1.3)  were  given 
by  Ford  (1957).  For  testing  the  hypothesis  of 


5 


equal  preferences,  Beaver  (1974)  proposed  an 
alternative  test  based  on  a  locally  asymptotic 
most  stringent  criterion. 

In  his  book  on  individual  choice  behavior, 
Luce  (1959)  established  a  choice  axiom  using  basic 
probability  principles.  This  axiom  provides  a 
method  for  determining  the  probability  that  one 
item  is  selected  from  a  fixed  set  of  items.  He 
defines  U  as  the  universal  set  of  possible 
alternatives  from  which  a  subject  must  be  able  to 
make  a  preference  for  one  of  the  items.  Further, 
T  is  a  finite  subset  of  U  and  must  contain  the 
chosen  item.  The  notation  P.p(S)  denotes  the 

probability  that  the  selected  item  lies  in  the 
subset  S  where  S  is  a  subset  of  T.  P(x,y)  is 
written  to  represent  P^x  y) (x)  w^en  x? £y  • 

AXIOM  1.  Let  T  be  a  finite  subset  of  U  such 
that  for  every  ScT,  Ps  is  defined 

i)  P(x,y)^  0,1  for  all  x,y€T,  then  for 

RcScT ,  Pt(R)=Ps(R)Pt(S) 

ii)  If  P(x,y)=0  for  some  x,y€T  then  for 
every  ScT,  PT(S)  =  pt_{x}  (S-{x})  . 


6 


Using  the  logistic  model  Luce  goes  on  to  apply 
this  choice  axiom  to  a  variety  of  psychological 
areas.  After  comparing  this  axiom  with 

Thurstone’s  discriminal  process,  Luce  concludes 
that  although  the  two  are  “logically  distinct”, 
the  assumptions  are  “extremely  similar  for  all 
practical  purposes”. 

Following  the  establishment  of  these  basic 
models,  theorists  studied  various  other  areas. 
The  effect  of  ties  or  a  no  preference  judgement 
was  considered  by  Rao  and  Kupper  (1967)  who 
extended  the  Brad 1 ey-Terry  model  to  allow  for  the 
occurrence  of  such  ties.  They  introduced  a 

parameter,  77,  interpreted  as  a  threshold  of  sensory 
discrimination  for  the  judge.  Whenever  the 

estimate  of  the  difference  in  response  for  two 
objects  is  less  than  77 ,  a  tie  results.  When  77=0, 
the  model  reduces  to  that  of  Brad  1 ey-Terry . 
Davidson  (1970)  went  further  in  studying  the 
effect  of  ties  by  developing  a  parameter  that 
depends  only  on  the  net  performance  of  the  judges 
and  not  on  the  performance  of  an  individual  paired 
compar i son . 

Another  extension  of  the  Brad ley-Terry  Model 


7 


involves  the  consideration  of  order  effects.  In 
psychological  experiments  the  effect  of  order  of 
presentation  may  be  of  more  importance  than  the 
actual  effects  of  the  stimuli.  An  additive  order 
effect  parameter  was  introduced  by  Beaver  and 
Gokhale  (1975)  to  account  for  the  effects  of  order 
in  the  presentation  of  stimuli.  A  common 

parameter  6—  for  the  ordered  pairs  (i,j)  and  (j,i) 
would  inflate  the  probability  of  one  item  being 
preferred,  while  deflating  the  probability  of  the 
second  item  being  preferred.  Since  the  order 
parameter  varies  from  pair  to  pair,  the  number  of 
order  effect  parameters  becomes  quite  large,  and 
the  model  awkward  to  analyze. 


Later,  based  on  the  extended  Rao-Kupper  and 


Davidson  models. 

a  multiplicative 

order 

effect 

mode  1  was 

introduced  by  Davidson 

and 

Beave  r 

(1977) . 

Since 

the  Bradley-Terry 

Mode  1 

uses 

parameters 

1  i  near 

on  the  log-scale, 

it  is  natural 

to  consider 

such 

a  multiplicative  effect. 

They 

proposed  a 

parameter  7 ,  independent  of  the 

i nd i v idual 

pai rs . 

When  7=1  there 

i  s  no 

order 

effect,  whi 

le  7>  1 

inflates  the  worth 

of  the 

second 

st i mu  1  i  and 

7<1  deflates  the  worth 

of  the 

second 

8 


object.  This  changes  the  probabilities  of 

preference  from  the  original  model  to  account  for 
the  order  of  presentation  of  the  items  within  a 
pair.  The  probability  that  the  first  item  is 
preferred  when  given  (i,j)  is  expressed  as 

P(  *  j  I  *  >  j  )  =  # -f -yll  .  (1.1.5) 

while  probability  that  the  second  object  is 
preferred  is  given  as 

p  ( J  *"  i  1 1  i  J  )  =  (1.1.6) 

for  l<i^j<k. 

Finally,  a  model  for  triple  comparisons  was 
developed  in  detail  along  with  the  necessary 
asymptotic  theory,  by  Pendergrass  and  Bradley 
(1960)  and  Beaver  and  Rao  (1972).  More  recent 
studies  have  considered  all  possible  pairs 
including  comparison  of  identical  items  to  help 
estimate  order  effects  and  tie  parameters 
(Sirotnik  and  Beaver,  1984),  as  well  as  various 
applications  to  the  design  of  paired  comparison 
experiments.  Such  applications  include  mixture 


9 


problems  (Charnet  and  Beaver,  1988)  and  factorial 
paired  comparisons  in  experiments  (El-Helbawy  and 
Ahmed ,  1984) . 

Little  or  no  work  has  been  performed  that 
considers  more  than  a  simple  binary  response  (y=0 
or  1)  on  the  Brad  1 ey-Te rry  model.  A  paper  by  Tutz 
(1986)  does  extend  this  model  to  allow  for  the 
strength  of  preference  for  a  responsdent  to  decide 
whether  the  treatments  differ  greatly  or  only 
slightly.  Tutz  uses  an  ordinal  response  paired 
comparison  system  with  k  response  categories  where 
the  parameters  91...0k  (determined  uniquely) 
characterize  the  discrimination  power  of  the 
response  categories.  Tutz  claims  that  his 

generalized  Brad  1 ey-Terry  model  is  more  precise 
than  the  original  model.  This  becomes  important 
if  the  value  of  a  future  response  is  to  be 
pred i cted . 

The  present  study  extends  this  approach  to 
develop  a  general  mathematical  model  relating  the 
sensory  perceptions  of  judges  via  an  arbitrary 
function  to  a  paired  comparison  model  of  the 
Brad  1 ey-Te rry  type.  This  unknown  function  maps  a 


10 


response  from  a  continuous  scale  over  the  real 
numbers  to  a  value  between  0  and  1.  In  Chapter  2 
the  necessary  definitions  and  the  basic  model  that 
will  be  used  are  introduced.  A  minimum  mean 

squared  error  criteria  is  used  in  determining 
estimates  of  the  parameters  in  a  reasonable 
general  context.  Specific  development  of  the 

estimators  of  the  parameters  as  they  relate  to  the 
logistic  distribution  is  explored  in  Chapter  3. 
Here  we  consider  both  a  truncated  logistic 
distribution  and  a  limited  integral  approach  to 
the  problem.  In  each  of  these  cases,  we  examine 
the  estimates  using  data  obtained  from  all  ordered 
pairs  (i,j)  and  (j,i),  as  well  as  data  that 
considers  only  the  pairs  (i,j)  with  l<i<j<k.  In 
Chapter  4  we  continue  to  consider  the  properties 
of  the  estimators  that  involve  linear  combinations 
of  order  statistics  or  L-stat i st i cs .  Here,  we 
first  consider  a  general  form  for  the  mean  of  the 
L-stat istics  and  a  bound  on  the  variance  term 
using  the  underlying  logistic  distribution.  Then 
we  provide  an  example  using  a  logistic  mapping 
function  within  the  framework  of  the  logistic 
distribution.  The  accuracy  of  the  parameter 


11 


est imates 
s i mu  1  at i on , 
results  of 
scale  techn 


is  explored  in  Chapter 
while  in  Chapter  6  we 
the  study  to  measurements  on 
ique  . 


,  through 
apply  the 
the  Likert 


12 


CHAPTER  2 


DEVELOPMENT  OF  THE  BASIC  MODEL 

2.1  Definitions  and  Notation 

In  the  development  of  a  model  that 
generalizes  the  0-1  binary  response  of  the 
Brad  1 ey-Te r ry  Model  to  one  lying  in  the  interval 
(0,1),  the  definitions  and  notation  which  follow 
are  convenient. 

Def  i  n  i  t  i  on  1  :  Let  I!..  .  .  be  random  errors 

associated  with  a  subject’s  perception  to  the  k 
items.  The  variables  U^.  .  .  are  assumed  to  be 
independently  and  identically  distributed, 

centered  at  0  with  cumulative  distribution 
function  (cdf)  G(.)  which  may  be  known  or  unknown. 
The  difference  U-— Uj  has  cdf  F,  centered  at  zero. 

Def i n i t i on  2 :  The  random  variable 

Vij  =  (Ui-Uj)  +  (A‘i~Pj)  >  which  is  the 
perceived  difference  in  the  worth  of  items  i  and 
j,  has  location  parameter  /Jj)  and  cdf  F 

centered  at  The  unknown  parameters  and  jij 


13 


are  used  to  define  the  relative  worths  of  the  ith 
and  jth  items. 


De  f i n i t i on  3 : 

The  response  elicited  is  Y^j=  ( V  ^  j ) 
where  <f>  is  a  monotonic,  increasing,  continuous 
function  that  maps  the  real  numbers  onto  the 
interval  [0,1] . 

Throughout  this  study  we  will  use 
ordered  pairs  so  that  the  pair  ( i  ,  j )  is  different 
than  the  pair  (j,i)  with  the  first  member  of  the 
pair  being  presented  first  to  a  judge,  l<i<j<k. 
The  function  F-j(y)  will  denote  the  cdf  of  the 
random  variable  Y-j,  the  judges  response  to  the 
ordered  pair(i,j)  while  Fj^(y)  denotes  the  cdf  of 
the  random  variable  Yj  j  ,  comparison  of  the  pair 

(j  *  i)  • 


The  estimators  of  the  function  <f>  and 
the  unknown  parameters  are  determined 
using  a  minimum  mean  squared  error  criterion.  We 
begin  by  observing  that  the  cdf  of  Y^j  is  related 
to  the  cdf  of  V|j 


14 


Fi j(y)  =  P(Yij  <  y)  =  P («(vij)<  y) 

=  P(Vij<  0-1  (y)  ) 

=  p[^~1(y)-(pi-pj)]  (2.1.1) 


where  F  is  the  distribution  function  of  d-j=  LL— Uj. 

Let  Fjj^(y)  be  the  sample  estimate  of  F-j(y),  which 

is  equal  to  F^-1  (y)  —  (p ^  — /i j)  j  the  true  cdf  of  the 

random  variable  Yj..  For  ease  of  presentation,  we 

will  let  n-j=n  for  all  pairs  (i,j)  so  that  there 

are  an  equal  number  of  comparisons  for  each  pair. 

The  estimates  of  <f>  and  uL  =  (a*i>/*2>  •  •  •  Pk)  are  those 

which  minimize  the  difference  between  F-j(y)  anc* 

F<>) 

ij  ’ 


wh  i  ch 

is  equivalent 

to  min imizing 

and  F  1(F^j^(y))  for  l<i<j<k. 

,  we  will 

minimize  the 

difference  between 

(y)  -/* 

))  and  F-1F  j  j  ^  , 

or  simplified  to 

and  F_1F^. 

If  we  define 

f .  j(y)  =F  1(F^j^(y)),  then  the  criterion  to  be 
minimized  is  given  by 

k  r 

Q  =  2  I  C*  ‘(y)  ~  (Mi“^j>  "  fij(y)]2dy 

i^j  o 


(2.1.2) 


By  expanding  Q  we  have 


15 


f  k 

Q  =  k(k-l)j  [^(y^’dy  +  EC^j-Pj)2  + 

o  i  ^  j 

k  r  k  I- 

E  J'fij2(y)dy  -  2^  (/ij-zi.)  U_1(y)dy  - 
i^j  o  i^j  o 

k  r  k  J- 

2E.  I ^-1(y)f  i j(y)dy  +  2^  (a» i— a* j) jf  j j(y)ciy 

i^j  o  i?tj  { 

k 

where  ]T  is  interpreted  as  the  double  sum  over 

i  #  j 

both  i  and  j  excluding  terms  "for  which  i=j  . 

Minimizing  Q  involves  taking 

derivatives  with  respect  to  the  unknown 

parameters,  setting  these  derivatives  equal  to  0. 
Solving  the  resulting  equations  provides  the 
parameter  estimates  which  follow: 


dq 

d/i. 


=  2E  0*1-^)  -  2E 

j(*i)  j(*i) 


2E  I  (f  i  i(y)-f :  j(y))dy. 
j(#i)  o 


Then  , 

°=  E 

JO1)  j(^i) 

+  E  j(f  ij(y)--fji(y))dy 

J  (#  O  0 


.  K 

for  i  =  l,2,  ...k.  Us  i  ng  the  const  r  a  i  nt  £  A4  • 

j  =  l  J 


=  0, 


16 


"this  equation  can  be  written  as 


0 

and 

k  f 

**i  =  A  £  [fji(y)  -  fij(y)3dy>  (2.i.3) 

J  (^i)  o 

i=l ,2. . .k. 

In  solving  for  ^-1(y)  we  rewrite  Q  as 
1  k 

Q  =  /  £  {<^-1  (y)  —  f  ij(y)-^ij}2dy  Which  is 
0 

minimized  whenever  the  summation  is  minimized. 

This  follows  from  the  fact  that  (x;  i~a) 2  is 

j  J  J 

minimized  for  a=x  .  Therefore  we  consider 


k  r 

=  2k^i  +  £  I  (fij(y)  -  fji(y))dy 
j  (  ^  i  )  0 


k  f •  -(y)  k 

Eifcn  +.?. 


k(k-i) 


i  j(y) 


k 


(2.1.4) 


Throughout  the  study  the  assumption  is 
made  that  in  the  experiment  in  which  we  compare 
item  i  with  item  j,  P(j—»i)  =  1  -  P(i— >j).  This  in 


17 


turn  leads  to  the  relationship  which  will  prove 
useful  in  future  results 

F_1  (F i j(y) )  =  F-1 ( 1  -  FjjCl-y)).  (2.1.5) 

For  each  pair  (i,j)  we  have  n  responses  on  [0,1]. 
By  using  the  order  statistics  and  the  associated 
values  of  Fjj^(y),  the  integral  in  2.1.3  simply 
becomes  the  area  under  a  step  function  with  steps 
of  1/n  at  the  values  of  the  order  statistics  Y*^, 
Y2lj,  ...  Yljn.  This  leads  to 


}F-1(Fn(y))dy  =  F-1(0)  [Y^-O]  +  F"1  ( 1/n)  [Y^-Y^n] 

+  F"1(2/n)  [Y^3-Yn^]  +  •  •  •  + 

F  1  (n- 1 /n)  [Yn^  n  — Yn£j .  n] +F  1  (n/n)  [1  —  Ynj  n]  . 

(2.1.6) 


2 . 2  Est  imat  ion  of  Parameters  ( w  i  t hout  order") 

Using  the  form  given  in  the  previous 
section,  the  estimators  now  appear  as 


£  {  F'l(0)[YlVn-0]  + 


j(*i) 


F-1  (1/n)  [YJ.In-Yi';n]  + 


1 


F-1(n^l)[Yjin-Yii-1:n]  +  F"1  (  1 )  [1  -vl)  n] 


18 


The  estimates  of  nlt...nk  and  <^(.)  are 
based  on  the  assumption  that  both  the  pair  ( i  ,  j ) 
and  the  pair  (j,i)  are  sampled  and  both  sets  of 
data  are  available.  If,  however,  the  sampling  is 
done  only  on  (i  ,j)  the  model  is  taken  to  be 
reflective.  In  other  words,  we  will  employ  the 
reasoning  that  ^-1  (u)  =  —  <t>~1  ( 1  -u)  and  therefore  that 
<j>~1  (yj  .  )  (  1  —  y  j  j)  .  Recall  from  2.1.5  that,  in 

addition,  F-1  (Fj  j  (y)  )  =  F’1  (  1  — F  .  ^  ( 1  -y)  )  . 

For  this  reflective  case  then,  where 


the  data  is  used  twice  (once  for  y  j  j  and  once  for 
yj.=l-y^j)  the  parameter  estimates  become: 


A>i=  ot  E  {F-1(0)  [1— Yn'-n]  +  F"1  (*)  [1  -Y^ j  .  „] 
(j*i) 

-(1-Y^n)]+  ...  +  F-^SSr1)  [1  -  Y1i:jn-(1-Yai!jn)]  + 

F-1  ( 1 )  [  1  —  (  1  -  YjVn)]  -  F"l(0)  [Y/.^  -  0]-... 

_F-1(nSi)[YiJn-Y^1;n]  -  Fjj(l)  [1  —  Yn  :  n]  } 


19 


=  ^  £  {F-1(0)  — F-1(l)  + 

J(*l}  Y11.Jn[F-1(l)-F-1(^)-F-1(0)+F-1(^)] 

Y2i:jn[F-1(^I)-F-1(^)+F-1(^)-F-1(A)]  + 

•  •  •+  YA‘ln[F-1(^)-F-1(0)+F-1(l)-F-1(£lpi)]  . 


*~'M=  E(tT)  p',(Fij)C>'» 


k(k-l) 


(2.2.2) 


2 . 3  Bas i c  model  with  ordei 


With  an  order  effect  ( -y  ^  j )  added  to 
the  model,  (se  David  and  Beaver),  we  return  to  the 
minimum  mean  square  error  criterion  to  determine 
the  parameter  estimates  for  /i  ^  ,  <£-1(y)  and  7  ■  j . 

Q=  E  }{o>"1(y)-(Mi-^j+fij)-  F_l(Fi j)(y)}2dy 

i^j  o 

(2.3.1) 

Expanding  (2.3.1)  leads  to 


r  .  k 

q=  k(k-l)  I  [^_1(y)]2dy  +  £  U  i  -^j  +  7  i  j>  *  + 
0  i 

k  k 

E  fi/dy  ~  2E  ^i_^j+7ij)j  ^'l(y)^y 

i^j  i^j  0 


k  }•  k  ‘f 

-2E  r  1(y)f i j(y)dy+  2E  (*li“*lj+7ij)Jf ij(y)dy- 

i^j  0  i^j  0 


20 


Again,  represents  the  double  sum  over  both  i 

j  t*  i 

and  j,  while  ^2  represents  the  sum  over  j  alone. 

j  Oi) 

Since  the  parameters  /ix ,  .  .  . /it  can  be  estimated  only 

up  to  a  scale  parameter,  we  also  use  the  condition 

k 

again  that  ^2  p  ■  =  0.  For  this  study  we  also  use  the 

j  =  l  J 

condition  that  7^.  =  -7jj.  In  minimizing  Q,  we  -find 


dq 

d^i 


2  £  (^i-^j  +  ^ij)  -  2  £  Uj-Z'i-Tij) 

j(^i)  j(*i) 


+  2  £  j  (fji(y)  -  f i j(y) )dy . 
j  (  ¥=■  i  )o 


which,  when  set  equal  to  zero  leads  to  the 
estimate  For  /i  ^  given  as: 


2k 


[  -at.j 


k 

+  £ 
j(*i) 


-f i j(y))dy  ] • 

i=l , 2 , . . . k . 


(2.3.2) 


The  estimate  for  0-1(y)  is  identical  to  that  of  the 
basic  model  (2.1.4)  when  we  assume  that  7^ j  =  —  7 j j 
because  of  the  cancellation  of  these  terms  from 
the  equation. 


The  estimate  for  7 ^ j 
the  solution  to  the  equation 


is  found  as  shown 


21 


dq 


3771  =  -  sOj-^-nj) 


+ 


i  J 


1  i 

2 [  |  ^ij(y)dy  -  |  fji(y)dy  ]=  o, 


from  which  we  obtain, 

l 

^ i j  =  -*ij  +  5  |(fji(y)dy  -  fij(y))dy 
o 

(2.3.3) 


with  <$••  =  n-~n-  l<i  For  the  design  in  which 

1  J  1  J  5 

all  ordered  pairs  are  used,  the  estimates  for  /i- 
and  7. .  can  be  written  as: 


h  =  *  +  £  {F-'(0)CYj:i„-  0]  + 

V  j(#i) 

F"l(^)[Y^:in-Y^:in]+  ...  + 

F-1(l)  [l“Yn:n]  -  F“l(0)  [Y^-0] 

-.  .  .-F-^l)  [l-YA^n]}j 

(2.3.4) 

>ij=  "*ij  +  i(Z  {  F-1(0)CYj;in-  0]  + 
j  X  ^  i  ) 

F"1(*)CY^n-Y^;In]  +  •••  +  F"1  ( 1 )  [1  -Y^  \  n] 
—  F-1(0)  [YxVn  —  0]  -...-F-XlKl-Y^n]}) 

(2.3.5) 


22 


In  the  case  of  distinct  pairs,  the  estimators  in 
2.3.4  and  2.3.5  can  be  simplified  somewhat  (see 
2.2.2).  The  properties  of  these  estimators  will 
be  addressed  in  Chapter  3,  as  they  pertain  to 
specific  distributions. 


23 


CHAPTER  3 


USE  OF  LOGISTIC  DISTRIBUTION  IN  BASIC  MODEL 

3 . 1  Truncated  D i stri but i on  Approach 

The  logistic  distribution  is  commonly 
used  in  the  study  of  paired  comparison 
experiments.  In  its  simplest  representation,  the 
logistic  distribution  is  given  as 

F  (x)  =  - -  for  xG'Dk 

v  2  1+e  x 

with  density 
f(x) 

=  F(x) (1  — F(x) )  (3.1.2) 

In  order  to  obtain  finite  values  for  all  terms  in 
the  parameter  estimates  given  in  equations  2.3.1 
and  2.2.2,  it  is  necessary  to  limit  the  range  over 
which  the  logistic  random  variable  is  defined.  One 
method  involves  truncating  the  distribution;  a 
second  method,  discussed  in  Section  3.2,  involves 
limiting  the  range  of  integration  which  appears  in 
the  estimates.  Without  some  type  of  truncation  or 


-x 


( l+e~x) 2 


(3.1.1) 


24 


censor 1 ng , 


the 


inverse 


mapp i ng  will 


i  nvo  1  ve 


F-1(0)=-oo  and  F-1(l)=+oc,  and  hence,  give  rise  to  a 
divergent  integral.  The  symmetrically  truncated 
standard  logistic  distribution  takes  the  form 


F  (x)  = 


1  _  1 

l+e~x _ 1+e01 

1  _  1 
l+e-a  1 +e“ 


-o<x<a<oo 


(3.1.3) 

where  a  is  the  upper  truncation  point  and  -a  is 
the  lower  truncation  point.  We  assume  that  a  is  a 
known  parameter.  If  a  is  not  known,  for  all 


pract i cal 

purposes 

one 

can 

take 

a  to  be  a  large 

number , 

say  10, 

so  that 

the 

interval  (-a, a) 

contai ns 

more  than 

997. 

of 

the 

distribution.  The 

truncation  occurs  for  the  variable  v^  so  that  the 
domain  — oo<v,- -<+oo ,  is  replaced  by  -o<vtJ<a  which 
then  implies  that  the  range  of  IK—  LK  is  given  by 
-a-6t;<  U.-U,  Ka-Sij. 


3.1.1  CASE  I:  ALL  ORDERED  PAIRS 

Case  I  is  used  to  denote  the  paired 
comparison  experiment  involving  all  ordered  pairs 
so  that  data  is  collected  on  all  possible 
combinations  of  treatments  (  i  ,  j )  and  (  j  ,  i  )  . 


25 


l<i,j<k.  This  sampling  design  allows  for 

estimating  order  effects  that  might  be  present  in 
the  data.  Case  II,  in  which  data  is  collected 
using  only  distinct  pairs  (i,j)  l<i<j<k,  and  then 
used  again  as  data  for  the  pair  (j,i)  by  a  simple 
reflection  technique,  is  discussed  in  Section 
3.1.2.  The  inverse  function,  F-1(  .  )  ,  is  needed  to 
estimate  the  parameters  in  the  basic  model  given 
by  equations  2.2.1  and  2.2.2.  To  solve  for  this 
inverse  function  we  observe  that  V  .  j  =  <p-1  ( Y  •  j)  .  Let 
p=F(y)  and  solve  for  y  as  follows: 


P 


1 

l+e° 


1 

l+e" 


p(ea-e-“) 

(i+e")(i+e-) 


l+e 


-y 


1 

1  +ea 


p(ea  — e~a)+  1  +  e-a 
( l+ea) ( l+e_a) 


_ 1 

1  +e 


-y 


,  ,  -y  _  (l+ea)(l+e-°) 

+  “  p(e°  — e~a)  +  (  l+e-a) 


-y  (l+e-a)ea— p(ea-  e"3) 

6  “  p(e-“-e°)  +  1  +  e-° 


y 


In  P(e~°  ~  ea)  +  1  +  e~° 
(l+e“°)e°  -  p(e“  -  e~a) 


(3.1.4) . 


26 


and 


F-1  (P) 


( l+e~Q) ea— (1-p)  (eQ-e~a) 

(1-p)  (e“-e-Q)+l+e"a 

(3.1.5) . 


With  the  general  form  for  developed 
in  equation  (2.2.1)  combined  with  the  solution  in 
equation  3.1.6,  using  the  logistic  distribution  we 
have  : 


2k£,  =  £  jF_1  C1)"*'"1  C1)  + 

J  (#i)^ 


Yr:n(-«-(ln 


n ( 1 +e-a) e“— (n-1)  (e“-e'°) 
(n-l)(e“-e-“)+n(l+e-°)  + 


>) 


+YJ\„|  ln-U'-  . 


n(l+e  °)  ea  —  e^  +  e-0  (n-2)  (ea-e  “) +n  (1+e  "“) 


ea-e-<5!  +  n(  l+e-a:)  'n(l+e  “)  e°— (n-2)  (e“-e  a) 


+  .  .  .+  Yn  ;  n  ln^ 


n(l+e  °)e0— e°+e~a 


e“-e  a+n(l+e  °) 


+  a 


,  v/1;  (  ,  ,  ea  — e-“  +  n(l+e°) 

+  Yi:n(a+  1  nn  (  l+e~a)  ea  — e"  +  e-c 


2(e“-e-°)  +  n(l+e~°)  n  ( l+e"a)  e°-ea  +  -ae 
2:nl  n  ( 1+e-01)  — 2(ea-e~“)*  ea-e  "a  +  n  ( l+e~Q) 


+  In 


+  •  •  • Yn : n (  a+  1 n 


n(l+e~g)ea— (n-1) (e°-e~°) 
(n-1)  (e°,-e-a)  +  n  ( 1+e  ~a) 


(3.1.6) . 

With  F-1(l)=a  the  above  can  be  expressed  in  a 
summarized  form  as  follows: 


27 


(3.1.8) 

Using  the  estimator  given  in  equation 

2.1.3,  we  can  investigate  the  1  im  by  examining 

the  behavior  of  /  fn(y)  dy  =  /F'1  (F^ (y) )  dy . 

0  J  0  J 

Now,  F^j^(y)  — ►  Fjj(y)  uniformly  in  y,  so  that  for 

Fij(y)=  F(^>-1(y) -Ui-Mj))  have 

/  F~l(Fij)(y))  dy  /  (<^_1Cy)  -(^i-A'j))  dy. 

o  o 

Since  <£(  y)  is  monotonic,  increasing,  and  skew- 
symmetric  at  y=j ,  that  is  <£-1(y)  =  -<£_1(l-y). 

Hence , 

l 

J  (^_1(y)-  j) )  dv  =  —  (a^ i~ a* j)  • 

0 

This  leads  us  to 


28 


ya.  =  2E  £  fij(y)  dy} 

j  (#0 

=  A  E  (-(^-^i)+  (^i-Mj) 

J=1 

=  A  2k"i 


= 


We  can  therefore  conclude  that,  asymptotically,  £• 
is  a  consistent  estimator  of 

By  considering  the  expectation  of  ft  ^  we 
can  determine  if  this  estimate  is  unbiased.  After 
taking  the  expectations  of  both  sides  of  equation 
3.1.8  we  have : 


£(/*:)  =  A  E  £  {cn-mE(Y^n)+cmE(Y^n)} 
2kj(^i)m=2  1  J 

(3.1.9) 


S  i  nee 


the 


expectat ions 


of 


orde  r 


statistics  are  approximated  by  quantiles  whereby 
'(n)  \  ^  r-i  t  r 


E(Ym;n)  «  EIj^n+l^’  We  Can  s°lve  for  these 
quantiles  with  F(x)=p,  as  shown  below: 


P  = 


—  1+e 


-x 


1+e 


1+e 


l+e“ 


29 


X 


p (ea-e  °)+l+e  ° 
(l+e"<,)ea-p(et'-e"<>) 


which  then  gives,  for  the  value  of  the  rth  order 
stat i st i c : 


E(Yr^n)  as 


In 


_ H —  (  ^  a  _  ^  “ 

n+1^  e 


')  +l+e' 


(3.1.10). 


This  approximation  depends  upon  4>  which  is 
unspecified  so  we  defer  the  discussion  of  the 
expectation  until  we  consider  linear  combinations 
of  order  statistics  in  Chapter  4. 

The  estimate  of  <£-1(y)  for  the  truncated 
logistic  function  in  Case  I  appears  in  quite  a 
different  form.  Recall  from  Chapter  2  (2.2.1), 

that 

k(k-l)3-1(y)  =  £  |f-1(F i  j}(y)  )  ~F-1(Fj")(y)  )| 

i<j=ll  J 

(3.1.11) . 

The  sample  cummulative  distribution 
functions  are  given  by 

Fij)(y)="if1  I<Yij'y> 


30 


and 


'ji}(y)=  (r)  £  I(Yu,y)  (3.1.12) 

i  =1 


where  the  indicator  function  is  defined  as 
I (a , b ) =  1  if  a<  b 
=  0  if  a>  b . 

Using  this  definition,  we  have  0-1(y)  defined  as 
f o 1  lows : 


F^)(y)(e“-e-“)+l+e-“ 

in - - t — r - 

(l+e-“)e“-F^;(y)(e“-e-“) 


1 


f  (l+e-»)e“-(l-F^)(y))(e“-e-“)  ) 

+  2J  sin  - 7— t ^ } 

j  i ) '  (l-F]i;(y))(e“-e-“)+l+e-“  J 

(3.1.13). 

Since  Fj  .  (y)  =  1  -F(0_1(l-y)  -  (#ij-/ij)  ) 

=  1-  F.j(l-y) 

and 

Fij(y)=F(^-1(y)-(^i-/ij) 


where  F(x)=  — -v ,  we  can  consider  the  consistency 

1+e  x 

Using  F^j^(y)  to  estimate 


of  <j>  (y)  as  n— +oo. 

Fjj\y)  we  have  from  equation  2.1.5  that 


31 


Fji}(y)=  i-  Fi j)(i~y)  • 

As  n— »oo  ,  then  F-1(f  [  j  ^  (y)  )  — »F_1  (Fi  j  (y )  ) 


i  J 

=  F-1FO-1(y)  j) 
=  <?i-1(y)-«ij- 


Similarly,  as  n— »oo 


F_1(Fij)(y))=  F_1(1-Fij)(1-y)) 


►r'd-F.jCl-y) 


F_1|i  —  CF(^_1(i-y)  -  ^  j  j| 

F  {  j  "} 

e  J+  1 

l  -(^,(y)-iJi)} 


1  +  e 

=  F~l(F(^-1(y)  — ^ j  j)) 

=  <T1(y)-*ji- 

Substituting  these  into  3.1.11  gives,  as  n— *oo, 


32 


k(k-  1 )  4>~l  (y) 


k (k- 1 ) 

- 2 -  2^  1  (y)  which  implies  that 


4>  l(y)  — ^-1(y)  for  fixed  y. 


Hence , 


y^oo  *~l(y>  =  kT^^iy"  =  ^_1(y)  v  y> 

(3.1. 14) 

showing  that  0-1(y)  is  a  consistent  estimator. 


3.1.2  CASE  II:  NON-QRDERED  PAIRS 

At  times  it  may  be  inconvenient  or 
impossible  to  collect  the  ordered  pairs  (i,j)  and 
(j,i).  This  gives  rise  to  the  situation  where  we 
only  consider  the  pairs  ( i , j )  with  l<i<j<k.  This 
alters  the  minimum  mean  squared  error  equation  as 
f o 1 1 ows : 


Q=  Ek  {/  ((^1(y)-(^i-^)-F_1(f^)(y)))2dyj| 


(3.1.15). 


This,  in  turn,  leads  to  different  solutions  for 
the  parameter  estimators.  First,  consider  the 
estimator  of  <£_1(y)  which  minimizes: 


33 


Q  =  EE  /  (<^"l(y)-(/1i-/<j)  -F_1(F [j}(y)))2dy 

i<  j  o 

Whereby  Q  is  minimized  when 

EE  {^“1(y)-(^i_^)-F"1(f,ni.(y))}2 

i  <  j  J 

is  minimized . 


As  in  2.1.4,  we  obtain  the  following: 


Mk-i) 


^  *-'M  =  ££(-1  -«j)+  ££  F-‘(P<f(y)) 
i<  j  i<  j 


<T  (y)  = 


k(k-i) 


{  Eo-1)^  -  E^i_1^i+ 

V  i=i  i=i 

tt  F”‘(Fij)(y))  } 

i<  j  J 

"<F)-crfei5{^i<k-ai+‘)'i  +^£F‘,o-Fij)o-y))} 


(3.1.16). 


Solving  for  /f •  leads  to  the  equation 


1  1 

+  2 (  i  —  1 )  Ji~l(y)dy  -  2(k-i)Jj>-1(y)dy 

o  o 

J  _  j  1  1 

-  2E  /  F_1(Fji)(y))  dy  +  2^  /F-1(F[")(y))dy  =  0 


J=1  0 


or 


i  -l 


0  = 


-  E^j  +  (  i  - 1 )  A  i  +  (k-i)Ai 

j=i 

-  (2i-k-l)  y^-1(y)dy  ~  J  F"1  (y )  )  dy 

o  j=i  o 

+  E  /  ^‘(Pifcv))^ 


which  leads  to  the  solution 


A'i  =  (k+l -2 i ) ^-1  (y)  +  £  F_1(Fj”)(y)  )  - 

j=i 

E  F"1  (F j j }(y) )  (3.1.17) 

j  =  i+i 

Using  the  estimate  for  given  in  3.1.17  in 

equation  3.1.16  results  in  the  estimate  for  4>~l 
wh i ch  f o 1  lows : 


<t>  (y)  = 


k(k-l)  (l>-  2i+l)(^)((k-2i+l)^-1(y) 


+ 


E  F"l(Fji}(y))-E  f_1(f [ j }(y) ) )  + 

j=i  j  =  i+i 


35 


(3.1.18). 


tt  F"‘(F[f  (y))l 

i<  j  > 

After  considerable  simplification  the  following 
estimator  for  ^-1  can  be  obtained 


4>  :(y)  = 


k-2 


_ 6 _ /y'fkz2i  +  l  V4  r-'fF^fy))  - 

(5k+2)(k-l)^\  k  jL, 

^  £  F-(F^>(y))}  +  tt  F-'(F<f  (y))) 

j=i  J  i<  j  ' 


(3.1. 19) 

Equations  3.1.16  and  3.1.19  will  involve  an 
iterative  procedure  in  order  to  obtain  actual 
values  for  fi  ^  and  4>~l .  Sampling  both  the  pairs 
(i,j)  and  ( j , i )  provided  a  precise  estimate 
without  double  use  of  the  data,  it  is  preferred  to 
this  latter  situation.  The  remainder  of  the 

research  is  concerned  with  the  case  in  which  data 
is  collected  using  both  ordered  pairs  (i,j)  and 

(J .  i)  • 


36 


3  ■  2  L  i  m  i  ted  I  ntegral  Approach 


An  alternative  method  for  handling  the 
logistic  distribution  in  order  to  obtain  finite 
expressions  for  all  terms  of  the  parameter 
estimates,  is  by  limiting  the  range  of  the 
integral  in  equation  2.1.3.  Recall  that  as 

mentioned  in  section  3.1,  the  inverse  mapping 
involves  F-1(0)=-oo  and  F-1  ( 1 )  =oo .  Instead  of 
assuming  all  values  in  the  interval  [0,1],  we  use 
values  in  the  interval  [e,l— «],  where  e>0  and  some 
small  number.  This  technique  bounds  the 

distribution  away  from  the  endpoints  that  lead  to 
+oo  and  -oo . 

The  limited  integral  approach  is  quite 
similar  to  the  development  in  Section  3.1,  in 
which  we  developed  specific  parameter  estimators 
for  b-1(y)  and  /ij  in  the  logistic  distribution. 
However,  this  development  is  slightly  simplified. 
The  parameter  estimates  are  again  based  on  those 
given  in  2.2.1.  Here,  we  will  consider  the 
situation  whereby  the  ordered  pairs  ( i , j )  and 
(j,i)  are  sampled.  For  0<p<l ,  the  inverse  of  the 
function,  F(y)=p,  is  given  by: 


37 


F(y) 


or 

y=  F"1  (p)=  In  (3.2.1) 


1'nis  gives  then,  for  the  parameter  estimates 


h  =  A  £  [fji(y)-fij(y)]dy  (3-2-2) 
j  (#i) 

where  "^  —  (y)  is  ^he  same  as  in  Section  3.1.  A 
brief  development  of  the  estimator  /i  -  ,  i  =  l,2,...k, 
which  follows  is  almost  identical  in  procedure  to 
the  one  given  in  the  Section  3.1  in  which  infinite 
values  for  F-1(0)  and  F-1(l)  were  avoided  by  using 
a  truncated  logistic  distribution. 


^i  = 


2^  £  I  F"l(OCY^:in-e]+  F-^^CY^-Yj/n] 


+ 


.  .  .+F-1(l-«)  [(l-0-Y^n]-F-1(0  [YiVn-e]  -• 


—  F-1  ( 1  -c)  [(1-0-YiWn] 


*i=  2R  E  {<~f)lniT7  +  Y^nCln  £-Trr^] 


38 


Y-"[ln2$f?0]  +"-+  Yn-l,n[l"2flrni]  + 

Viiinfln^j^]  +  (1-0  (lnlfi)  +  <  ln^  + 

Yi‘-[prff)i]  +  Y-J"[lnafer]  +---  + 

Yn--  =  n[l”2feri]  +  vi-ln  -  (l-O(ln^)j 


After  combining  like  terms  the  estimator  for  p  ■ 


^  i~  2k 


£  i- 


f  (n~1)Vvj  1 


)(Y5J:n  +  Y^ln) 


+  (ln2Cn?i))(Y*-'n  +  Yn-i:n)  +  •  •  •  +  (ln(^rf)7)  X 

(Y^n  +  YA^n)  +  (ln~ r^)(Y2:Jn  +  Y^1;n)+...} 


fii  =  ±  E  {  (lnl%lL))(YI):n+YAtn-Y11:Jn-YA^n) 

+  fYm:n+  Yn^.m+1;n—  Ym :  n  —  Yn^m+1;n^  X 

m=2  ,  „  ,  .  \ 

(ln (J-'H"-"))  l 

V  m(n-m+l)  )  j 

(3.2.3). 

The  properties  of  fi  •  will  be  explored  in 
Chapter  4,  which  deals  with  the  properties  of  the 
estimators  that  are  linear  combinations  of  order 


39 


stat i st i cs . 


3 . 3  Mode  I  with  an  Order  Parameter 


In  Chapter  2,  the  basic  model  was 

extended  to  include  an  order  parameter,  and 

the  general  estimate  for  this  parameter  was  given 
in  equation  2.3.5.  We  can  make  this  estimate  more 
specific  using  the  logistic  distribution  as  we 

have  done  with  fi  j  and  <£-1(y).  Here  we  will  only 
consider  the  estimate  as  it  appears  using  the 

limited  integral  approach  of  Section  3.2.  Recall 
that  in  the  general  form 

1 

hj=  -hj  +  1  /(F-,(F'ji(y)-F-'(Fij(y))dy 

0 

and  as  we  observe  in  Section  3.1, 


-  ^-,(yij)-<iij+rji]jdy 


(3.3.1) 


This  equation  gives  the  estimate 


for 


i  j’ 


in  2.3.3, 


40 


$*> 


in  which  7*j  is  determined  for  a  specific  (i,j) 
pa  i  r . 

Since  4>-1(y)  is  skew  symmetric  at  y=| 
then  3.3.1  becomes 


1  i  m 
n— ►  < 


7ij= 


■6ij+ 


I  {-(«ji+  Tji)  - 


=  -*ij  + 


6.  ■  + 

ij 


1 J 


Using  the  development  from  /i.  given  in 
the  discussion  preceeding  3.2.3,  we  have  the 
following  estimate  for  7jj»  as  it  appears  for  the 
logistic  distribution,  however,  the  range  of 
integration  is  limited  from  e  to  1-e: 


?ij  =  hj  +  i  (  (ind^XYAUviln-V^-Y^n) 
+  (lnf^^XY>  =  "+Yi-l  =a-Yi=Jn-Ci  .„)+■ 


I  'V?Cln("~nl)(m~1)¥YJi  j-YJi  _Y* •?  _Yij  \ 

(n-m+l)m  A  m  • n  n-m+l:n  m.n  n-m+l:nj 


m=3 


(n-m+1 ) 


(3.3.2) 


41 


for  0 < € < 1 . 


The  properties  of  7-j  will  be  similar  to 
those  of  i  =  l,2,...k,  and  will  be  discussed  in 
Chapter  4. 


42 


CHAPTER  4 


PROPERTIES  OF  THE  MINIMUM  MEAN  SQUARE  ERROR 
PARAMETER  ESTIMATES  INVOLVING  ORDER  STATISTICS 

4 . 1  Prope  rt  i  es  of  us  ine  the  i  nverse  1  ogj  st  i  c 

f unct i on . 

In  the  previous  chapter  we  presented 
estimates  for  fi  ^  under  different  sampling  designs 
and  different  functional  assumptions.  In  this 
chapter  we  consider  the  estimates  for  /ij, 
i=l,2,...k,  using  the  limited  integral  approach. 
After  developing  a  general  form  for  the  estimate, 
we  apply  this  by  taking  <£(y)  to  be  the  logistic 
cdf  which  made  the  computation  slightly  more 
simple . 

One  approach  to  assessing  the 
properties  of  estimates  given  in  3.2.3  and  3.3.2, 
is  to  use  the  theory  of  L-stat  i  st  i  cs ,  that  is, 
linear  combinations  of  order  statistics.  This 
approach  will  be  explored  for  the  estimators  and 
7ij»  l<i  in  Section  4.3.  There  are  several 

approaches  to  establishing  the  asymptotic  theory 


43 


associated  with  linear  combinations  of  order 
statistics  such  as  those  given  by  Shorak  (1972), 
Chernoff,  Gastwirth  and  Johns  (1967),  Boos  (1979), 
or  Stigler  (1974).  Shorak’s  approach  places  more 
stringent  restrictions  on  the  distribution 
function  and  less  stringent  requirements  on  the 
coefficients  usually  referred  to  as  weight 
functions  in  the  literature.  However,  the  approach 
of  Shorak  along  with  that  of  Chernoff,  Gastwirth 
and  Johns,  and  that  of  Boos  are  difficult  to 
apply.  We  prefer  the  approach  presented  by  Stigler 
(1974),  who  places  more  restrictions  on  the  weight 
f unct i on . 

Stigler  considers  the  statistics  Tn  and 
Sn  given  by  (in  Stigler’s  notation) 


1  =2 


and 


(4.1.1)  . 


in  which  J(u)  is  a  weight  function  defined  on 
[0,1]  and  X^.n  is  the  ith  ordered  observation  in  a 
sample  of  size  n.  Tn  is  the  expression  for  the 
linear  combination  of  the  order  statistics  of 


44 


equation  3.2.3,  excluding  "the  endpoints.  Stigler 
proposes  that  Sn  and  Tn  are  asymptotically 
equivalent  when  the  weight  function  J (u)  satisfies 
certain  conditions.  For  example,  J (u)  must  be 
bounded  and  continuous  almost  everywhere  with 
respect  to  F-1.  It  is  assumed  that  the  variance  of 
the  parent  distribution  is  finite.  We  give 

without  proof  the  following  theorems  due  to 
Stigler  (1974),  which  provide  us  with  the  means  to 
show  that  Sn  and  therefore  Tn  are  asymptotically 
normal . 

Theorem  1.  Assume  that  E|Xj|2<oo,  and  that 

J(u)  is  bounded  and  continuous  a.e.  F-1 . 

Then 

and 

lim  n<r2(Sn) 

If  ”“■*  OO 

where 

**(  J,F)  = 

2  J  J  J  (F(x)  )  J  (F(y)  )F(x)  (1  — F(y)  )dxdy 

-oo<X<y<oo 

(4.1.2) 

Theorem  2.  Assume  that  E(X2)<oo,  and  that 

J(u)  is  bounded  and  continuous  a.e.  F_1 . 


=  <t2(J,F) 
=  <t2(J,F)  , 


45 


Then  <72(J,F)>0  implies 


f-;^)  -  n(°-i> 


<r2(J,F)  is  given  by  4.1.2, 


as  n— *oo. 


* 


Theorem  3.  Assume  that;  E|X-|<oo,  and  that 
J(u)  is  bounded  and  continuous  a.e.  with 
respect  to  Lebesgue  measure .  Then  as 
n— kx>,  E(Sn)  — ►  /*( J  , F)  ,  where 

oo  1  o  F(x) 

"(J’F >=/  /  J(u)du  dx-J  J  J(u)du  dx 


0  F  (x) 

l 


-OO  0 


= J  J(u)  F*1(u)du 


(4.1.3) 


Theorem  4. 

Assume  that  /  [F(x)  (1 -F  (x)  )]  2dx  <  oo  and 
that  J(u)  is  bounded  and  satisfies  Holder 
condition  with  <*>2  (except  possibly  at  a 
finite  number  of  points  of  F-1  measure 
zero) .  Then 

J(E(Sn)—  M(J,F))^0, 
where  /i(J,F)  is  given  by  4.1.3. 


Theorem  5.  Assume  that  for  some  f>0, 

1  im  x£[1-F(x)+F(-x)]=0,  and  that  J(u)  is 
bounded  and  continuous  a.e.  F_1.  If  in 


46 


addition,  J(u)=0  for  0<u<a  and  1—  a<u<l, 
then  n<r2(Sn)  — xr2(J  ,F)  (given  by  4.1.2),  and 
if  <r2 (  J  ,  F)  >0  , 

1  (~^(ifyn))  -  n^0’1)  as  n->o°* 

(4.1.4) 

Furthermore,  the  assumptions  that  E|X.|<oo 

and  that  J  [F (x)  ( 1  — F(x)  ) ]  2dx  can  be  dropped 
from  Theorems  3  and  4  and  the  conclusions 
will  st i 1 1  hold . 

In  order  to  show  the  asymptotic 
normality  of  Tn  using  Theorem  5,  the  underlying 
cumulative  distribution  function  (cdf )  ,  F(x),  as 

well  as  the  weight  function  J(u),  must  satisfy  the 
following  conditions: 

i)  J(u)=0  for  0<u<q  and  l-a<u<l 

ii)  lim  x£  fl  — F  (x) +F  ( -x)  ]  =0  ,  for  some 

f>0 

iii)  J(u)  is  bounded  and  continuous 

a .  e  . 


To  determine  which, 

if 

any  , 

of  the 

above 

cond it  ions 

are  satisfied, 

we 

f  i  rst 

need  to 

find 

J (u)  .  We 

proceed  to  develop 

the 

mean  and 

47 


variance  terms  in  general,  given  J(u)  does  satisfy 
the  conditions.  Recalling  the  basic  equations  for 
/i  j  in  3.2.3,  we  observe  that  there  are  really  two 
linear  combinations  of  order  statistics  to 
consider,  one  involving  Y1"^  and  the  second 
involving  1  .  This  is  true  for  any  distribution 
and  mapping  function  satisfying  the  definitions 
provided  in  Chapter  2.  This  gives  then, 

*i=  A  E  {sji  -  sij}  where 

j  oo 

Sij  =  E  {Yril:„  -  Y-n} 

r=l 

=  £  YrL  {  F-1(C^1)-F-1(D  } 

r=l 

(4.1.5) 


This  enables  us  to  find  a  general  form  for  J(u)  as 
f o 1 1 ows : 


so  that 


J(u)  = 


k 


F_1(u+Au)  —  F-1  (u) 
Au 


48 


=  A  F-^u) 


(4.1.6, 


■for  u=  ^  and  Au=^. 


Using  4.1.6,  we  can  determine  the 
general  form  for  /i(J,F^j)  as  shown: 


1 

^(J,Fij)=  J  J(u)  FJ j(u)  du 
0 

1 

=  /  £  F“(u)  FT-(U)  du. 


Recalling  equation  2.1.1,  we  have  then 


F  i  j  (u)  =  F(^_1(u)-5ij)=  z 


F  x(z)  =  u) 


Therefore , 


^  (F  1(z)+^ij)=  u. 


FJ j(u)  =  0  (F_1  (u)  +  j) 


By  letting  G(u)  =F-i  (u)  =x  ,  then 


dx  =  G'(u)du  =  4-  F_1  (u) 


49 


This  results  in 


KJ>Fij)  =  j  G'(u)  F T j  (u)  du 
0 

1 

=  J  G'(u)  0(G(u)+dij)  du 
0 

oo 

=  J  <j>(  x  +  5  J  j)  dx 

—  OO 


and 

gives 


incorporating  the 


MJ.Fij)  =  / 


a+Sij 


■a+6 


1 J 


limited  integral 

<K  x  +  dx 


approach 


for  ^jX)  (4.1 .7)  . 


To  center  the  function  around  zero  instead  of  we 
let  ^  (z) (z)  — j  .  Then  equation  4.1.7  becomes 


o+«jj 


/‘(J.r'ij) 


/  w 


x)  — j 


-a+6  ■ 


i  J 


0+5  ij 


=  J  (x)  dx  +  j(Q;+6i j-(-o+«i j)) 


-a+*ij 


i  j 

=  «  +  J  iKx) 


dx . 


-a+6^. 


50 


By  observing  the  symmetric  properties  that  exist 
for  V’(x)  we  rewrite  the  equation  as  follows: 


a-6  •  • 
i  J 


a+*ij 


'(J’Fij)=“  +  /  V>(x)  dx  +  J  V’(x)  dx 


which  reduces  to 


-a+5 .  . 
1 J 


a+6ij 


Q^ij 


1  ( J  ’  F  i  j)  =  a  +  /  (<KX)-  5)  dx 

a-^i  • 

J  "+*ij 

=  a  -  tfjj  +  J  <KX) 


dx  (4.1.8) 


a"*ij 


Similarly,  we  can  determine  the  mean  for  the  Y 
portion  of  the  equation,  giving 


J  1 


°-*ji 


iCJ.Fjj)  =  a  +  6U  -  J  <KX) 


dx  (4.1.9) 


Q+*ji 


To  get  the  overall  mean  we  combine  the  two  which 
results  in 


a"*ji 


a+6ij 


i(J,F)  =  -  J  <6(x)  dx  -  J  <£(x) 


dx 


a+6  . 

J  1 


a-*ij 


=  +  B(a,{ij) 


(4.1. 10) 


51 


where  B(a,£jj)  is  the  bias  term  given  by  the 
integral  portion  of  the  equation  for  /z(J,F). 

The  bias  in  4.1.10  is  small  when  a  is 
large.  In  this  case  the  value  of  $(x)  is  close  to 
1  which  results  in  the  mean  reducing  to  the 
f o 1 1 ow i ng : 


=  -2  Siy  (4.1.11) 

Using  equation  4.1.5  and  the  fact  that  as  n— >oo 
E (Sn)  — ( J , F)  we  have 

k 

^i=  -2k  £  ^(J>F) 

J  i) 

k 

=  ^  E  -  2*ij  +  B(“^i)  (4.1.12) 

j  oo 

k 

where  8(0,^)  =  ^  E  B(“»*ij)- 

j)^i) 

Finally  then  we  have. 


52 


1  im  u  •  »  u . 

n— *oo  i  ^1 

when  a  is  sufficiently  large. 

We  have  then,  considering  that  there 
will  always  be  at  least  some  small  amount  of  bias 
present 

ED*i]  =  +  B*  (or ,  6  J  j)  (4.1.13) 

where  B*(a,6.  ■)  =  u  •  —  1  im  u  ■  for  all  a. 

v  i  J  '  i  n— »oo  l 

The  equation  given  for  the  variance 
term  (equation  4.1.2),  involves  so  many  variable 
terms  that  we  were  not  able  to  give  a  precise  form 
for  the  variance  in  the  general  case.  Instead  we 
can  bound  the  variance  which  allows  us  to  make  the 
variance  term  as  small  as  we  want  simply  by 
increasing  the  sample  size.  We  can  modify  4.1.2 
as  f o 1 1 ows : 


«r2(J,Fij)=  I  I  J(u)  u  (  1  -v)  dF  £  j  (u)  J  (v)  d  F  J  j  ( v) 

C<U<  V<1  — f 

<  /  /  J(u)  dFTj.(u)  J(v)  d  F  j  j  (v) 

6<U<  V<l-6 
1-6  1—6 

<5  /  /  J(u){^(F-1(z)+5ij}d  F-^z)  dz. 

e  ( 


53 


The  weight  function,  J(u),  will  be  finite  as  long 
as  the  density  f(u),  corresponding  to  F(u),  is  not 
zero  over  (-a, a).  Since  we  know  that  if  G(u)=z 
then  g(u)  du=dz  which  implies 


du  =  -4^4  =  — 4s — • 

g(u)  g(F~l(z)) 


If  £  is  bounded  then  we  know  x  is  bounded  so 
dF-1(z)  is  bounded  and  we  can  conclude  that  the 
variance  terms  is  finite.  From  Theorem  1  then,  we 
have  that 


then 


n<r2(Sn)  —  <r2(J  ,F) 


*2(Sn) 


<r2(J,F) 

n 


where  <t2(J,F)  =  ( 


=  (A)1  £  {'’(J.Flj)  + 


54 


4.2  Application  Using  a  Logistic  Mapping 
F  unct i on 

Using  the  general  form  for  /i(J,F)  and 
the  bounded  variance  we  can  consider  more  specific 
mean  and  variance  terms  using  a  logistic  mapping 
distribution.  The  common  coefficient  for  the  terms 
of  equation  3.2.3  is 


cm=  In 


m ( n-m+1 ) 
(m- 1 ) (n-m) 


(4.2.1) 


By  letting  m=nx  we  have 


cm 


{nx(n-nx+l )  1 

(nx-1) (n-nx)  j' 


From  4.1.1,  we  see  that  in  order  to  obtain  J  (x)  , 
we  must  take  n  times  the  coefficient,  resulting  in 


1  n 


nx(n-nx+l ) 
(nx-1 ) (n-nx) 


=  n 


1  n 


x (n-nx+1 ) 
(nx-1) ( 1  —  x ) 


=  nfin  _2^  +  in  2z22£±l~] 
L  1  — x  nx-1  J 


=  n 


x-i 
2  n 


1-x  l-(x-A) 


Now  let  f(x)=  In  and  h  =  jSj,  then 


55 


n 


[f(x)-f  (x-h)] 


1  n 


x 

1  — X 


x,_  JL 
n 


i -(x-A) 


Cons ider , 


— ♦  f'(x)  as  h— *  0. 


f'(x) 


X 

1  — X 


) 


o-V 


(4.2.2) . 

Therefore , 


J(u)  =  — - T 

u (1 -u) 

for 

u  e  (e,l-e) 

=  0 

for 

u  e  [0,0  ,  (1- 

is  the  weight  function  estimating  i  =  l,2,...k. 


With 

this  weight 

f unct i on , 

J(u)  , 

we 

systemat i cal ly 

proceed  to 

determine 

if 

the 

remaining  three 

cond it  ions 

concern i ng 

F  (x) 

and 

J(u)  are  satisfied 


i)  Since  the  weight  function  ranges  only  from  m=2 
to  n-1,  J(u)  is  defined  to  be  zero  for 


_  i^x  r  i 

-  X  1—5 


I  +  _1 

X  ^ 


1-x 


Hence 


J  (x)  = 


x(l-x) 


56 


either  of  "the  endpoints  as  required  by 
this  condition. 

ii)  For  this  condition  we  look  at  the  tail 
areas  of  the  distribution  of  the  random 
variable  Y  for  which  0<Y<  1.  The  tail 

area  condition  for  Y  can  be  expressed 
as 


lim  y£  [1-F..(y)  -  F..(l-y)]  = 
y  — ►  1  1J 

0 


for  which 

Fij(y)=  F(0"1(y)-(/ii-/ij)) 


where  F  is  a  central  logistic 

distribution.  Therefore,  for  fixed  6 ■  ■ 

i  J 


1  im 

y— i 


“ (lnr^y-^ i j) 

1+e  y  J 


+ 


=  1  im  y£<  1  - 

y-*1 


1 


-6.  . 


=  1  [1-1  +  0] 

=  0,  and  this  condition  is  met. 


57 


iii)  In  finding  the  original  weight  function,  it 
was  shown  that  the  derivative  of  F(x)  , 
given  by  F(x)(l-F(x))  was  bounded  by  1  and 
hence,  J (u)  is  also  bounded  and  continuous 
a.e.  satisfying  this  final  condition. 

Knowing  that  the  conditions  for 
Stigler’s  results  hold,  we  now  proceed  to  find  the 
asymptotic  mean,  /i(J,F),  and  variance,  <t2(J,F). 
Since  there  are  two  types  of  linear  combinations 
of  order  statistics  in  the  equation  for  fi^,  one  for 
Y1J  and  one  for  YJ  1  we  compute  their  means 
separately,  then  combine  them  since  the 
observations  are  independent.  After  appropriately 
substituting  into  equation  4.1.4  we  find: 

1-e 

=  /  J(u)  F j j(u)  du 


1-e 


f  1 

U  “1 

J  u(l-u) 

€ 

-6.  •  -6.  • 

_  e  1J  +  u ( 1 -e  1J)  _ 

-6 


1  -  e 


i  J 


+  e 


-6.  ■ 
1 J 


1  n 


u 


+  (l-u)e 


1  -  u 


u  + 


(l-u)e 

1  -  u  j  j 


58 


Similarly,  we  find 


-6  ■  . 

/it?  n  ,  (  u  +  (l-u)e  J1'\17e 

^(J,Fji)=  ln^  - - j  j 


Combining  "these  give 


KJ>F)=  A‘(J,Fji)-/i(J,Fij) 


=  In 


=  In 


u  +  ( 1 -u) e 


- ~  "--h  j1 

u  +  (l-u)e  1JJ 
1  l-c  +  e  e  1jJ  K  +  (l-£)e  1jJ 


1  -  u 
-6 


=  In 


,  -6.  ■  -6. . 
f 2(l-()e  +  (l-f)2e  1J  +  f2  e  J  *] 
1  -6.  .  -6.  .  I 

^2(1  -e)  €  +  e2e  1J  +  (l-e)2e  1J 


By  taking  the  limit  as  e— *  0,  we  have 


V™0  (cW.Fj^-cW.Fij)) 


=  In 


1-0  e 


"*ij> 


1-0  •  -6.. 
k  e  J1; 


in  ( 


-6 .  ■  6  •  . 

e  1J  +  e  J1  ) 


59 


-(2Aii-2/ij) 


(4.2.3) 


Summing  over  all  j,  with  =  —  /i  j  ,  gives: 

A  £  2(Mi-Pj) 

jC^O 

=  —  (2k/*  i  +  0). 

This  leads  us  to  lim^  E(£-)=  —  /i^.  (4.2.4) 

The  minus  sign  is  correct  because  we  need  to 
rewrite  the  original  equation  for  ji  -  with  a 
negative  sign  factored  out  of  the  summation,  as 
shown  in  section  4.1  with  Sn  •  This  is  a  finite 
analogue  of  the  integral  given  in  4.1.4,  so  that 

E«i>=  A  t. 

which  does  gives  us  the  1  im  E(/i^)=/i^. 

To  obtain  the  variance  term,  we  return 
to  equation  4.1.2,  which,  with  the  appropriate 
substitutions,  is  given  as  follows: 


60 


!(J,F)=2  J  J  J(u)J(v)u(1-v)  dF-1  (u)dF-1(v) 


0<u<  v<  1 


with  u=F (x)  and  v=F (y)  .  We  evaluate  the  integral 
as  shown  below: 


=2  f  f  —7-^ — \-—rr — Nu(i~v)— rr- — \-—rr — \dudv 

J  J  u(l-u)  v(l-v)  v  'u(l-u)  v(l-v) 


f <u<  v<i-e 


-*/  / 

«<U<  V<l-€ 


v2 ( 1 - v) u ( 1 -u) 


;du  dv 


=2  /  ^)(  /  du)  dv 
=2  7  lnl^  I)  dv 


(4.2.5) 


Expanding  and  combining  like  terms  then  results  in 


i  -e 

/  2~7"1 - x  1  n— -dv 

l  v2(l-v)  1-v 


nt  ±  — t 

^?dv  +  / 

dv)} 


(4.2.6) 

By  using  the  appropriate  integral  tables  4.2.6 
becomes 


61 


<r2  = 


1  -2v  o 
v(l-v) 


lnl^ 


1-e 

! 

£ 


V  -  1  n 


1  —  V 
V 


1  —  £ 
I 

f 


1  —  £ 
£ 

♦/ 

1  — e 


(In 


x) 


(1+x)2 


dx 


where  the  last  integral  expression  arises  when  we 


substitute  x  for 


After  algebraic 


1  —  v  • 

simplification,  this  last  integral  can  be 
separated  into  parts  for  integration  as  follows: 

1  —  € 


1  —  £ 
£ 


1  —  € 
£ 


J  (In  x)  dx  =  J  -^(ln  x) dx  +  J  |(ln  x)dx 


f 

1  —  € 


£ 

1  —  £ 


£ 

1  —  € 


1-f 


1  —  £ 


=  k  (-lnx-l  |  +  j((ln  x)2)  |  . 


€ 

1-e 


e 

1-e 


(4.2.7) 


Therefore 

<r2(J,F)  = 


2(1-Q-1 

(l-£)£ 


-  2  ln=-*-  + 


l-2£ 


l-£  T  (l-£)£ 


+  2  In  ^  +  (in  ^-T^X-rb-ln^  +  l  +ln^) 


+  *  +  V  i"  A  +  } 


62 


After  combining  terms  we  arrive  at  the  expression 
for  the  variance  term: 


F)=  2jsiZiig  +  ftln*  +  2(lnlf»)>|. 

(4.2.8) 

Recalling  that  Hnri  n/r2  (Tn)  =  ff'2  ( «J  » F)  gives  us  the 

asymptotic  variance  of  Tn  as 

-!(T„)=!  |  +  £ln  ^  +  2(lnlfl)’} 

(4.2.9) 

This  is  reasonable  since  the  variance  depends  on 
the  range  of  the  sample  (as  determined  by  e)  and, 
of  course,  on  the  sample  size. 


Finally,  then  we  can  conclude  that 


E  ( u  • )  —  u  •  a 

- Krf  t - -  —  N(0,1)  as  n->oo 

°  (  1  n) 


(4.2.13) 


/here  E(p^)  is  given  in  4.1.10 


63 


4.3  Properties 

of  Using 

Log i st  i  c 

I  nve  rse 

When 

an  order 

effect 

exists , 

the 

estimator  for 

the  order 

parameter 

’  7i  j’ 

also 

involves  order 

stat i st i cs 

as  shown 

in  equation 

3.3.2.  The  coefficients  of  all  of  "the  "terms  from 
m=2  "to  m=n-l  inclusive,  are  identical  to  those  of 
fi  ^  as  discussed  in  Section  4.2.  Therefore,  we  can 
draw  similar  conclusions  to  those  found  in  Section 

4.2.  Again,  the  weight  function  J ( u )  =  -  .  * ^  and 

u(l-u) 

the  mean  and  variance  for  Tn,  the  L-statistic, 
will  be  those  given  in  4.1.4  and  4.1.2.  An  added 
complexity  arises,  however,  when  considering  y .  j 
since  is  a  function  of  the  ordered  pair  ( i  ,  j ) 

and  (j , i)  only. 

Using  the  discussion  in  Section  4.2,  on 
the  conditions  for  Stigler’s  asymptotic 


d i st r i but  ion 

of 

L-Stat i st i cs , 

we 

can  proceed 

with 

the  asymptotic 

d i st r i but i on 

of 

Tjj  •  Using 

the 

deve 1 opment 

preceeding  equation 

4.1.5  we  rewrite 

equation  3.3.2  as 


(4.3.1) 


64 


where  S^j  is  defined  in  equation  4.1.5.  From 
4.1.10,  we  found  that  prior  to  summing  over  all 
p(J,F)=  26^  + 

Recall  that  fjjCy)  =  F-1(Fj^(y) 
and,  as  n  —  oo  ,  fjj(y)->  F'1(F(?6-1(y)  -  (<5j  j+7j  j)  )  )  ; 


=  <t>  1  (y)  —  (6 j  i+'l'ij) 

and 

f  ij(y)=F-1(F[j)(y))  -  F_1  (FO"1  (y)  ~  i  j  +  7i  j)  )  )  • 


=  <^~1(y)-(5ij+7ij)  • 


By  proceeding  similarly  to  the  development  for 
in  Section  4.1,  we  obtain 


o-CSji+lji) 

KJ  .Fj  j)  =a-(<i  j  +  7ij)  +  j  ^(x)dx 

—‘j  i+f  j  i 


o+iij+Tij 


i)=a+(«ij+7jj)-  j  *(x)dx. 


o-(«ij+7ij) 


Combining  these  results  gives 


65 


1  i  m 

n  — ►  oq 


i  j 


:-*ij  + 


7  .  .+ 
71J 


I  2(Sji+7ij)  -  /  *00 

“+<  j i+1j i 


dx 


1  J 


a+«ij+7iJ 


-(‘ij+Tij) 


*00 


(4.3.3) 


for  large  a,  as  discussed  in  section  4.1. 

Therefore,  since  there  is  always  some 
amount  of  bias,  7jj  is  a  biased  estimator  of  7^j 
and 


1  im 
n  — *oo 

E(7ij)=  ru  + 

) 

(4.3 

•4) 

whe  re 

B(«,#ij,rij)=  Tij-nm. 

*ij- 

From  section  4.1, 

we 

know  that 

the 

variance  term  is  bounded  for 

the 

L-stat i st i cs 

we 

are  investigating.  The  asymptotic  variance  for 


66 


the  linear  combination  of  order  statistics,  c2(Tn) 
is  given  in  equation  4.2.9.  This  leads  to  the 
variance  of  7  •  j  as  follows: 


(7  i  j)  =  <r2(  Tn) 


_  2  f 6e2  -  7e  +  2  .  4e  |n  (  .  1  1  -e vl 

"  n  \  7(1-0 2  “  ^  2(ln— )) 


(4.3.4) . 


Recall  that  the  bias  term  becomes  small 


for  large  a,  so  that  we  can  make  th 


e  variance  as 


small  as  necessary.  We  conclude  then  that  7^  is  a 
consistent  estimator  of 

Since  we  have  presented  an  analysis 
for  ordered  pairs,  the  estimate  of  order  effects 
is  also  included  since  order  parameters  are 
frequently  used  in  the  study  of  paired  comparison 
expe  r i ments . 


67 


CHAPTER  5 


SIMULATION  STUDY  USING  PARAMETER  ESTIMATES 


5.1  General  Approach 

This  chapter  uses  simulated  data  to 
empirically  explore  the  parameter  estimators  and 
determine  the  surface  validity  of  the  resulting 
estimates.  A  random  number  generator  provided 
values  of  a  uniform  random  variable  x,  defined 
over  the  interval  (0,1),  which  were  then 
transformed  to  values  of  a  logistic  random 
variable  V,  using  the  transformation: 


1  n 


x 

1  — X 


(5.1.1) 


The  logistic  random  variables  are  the  V^j’s 
discussed  in  Chapter  2  with  V^j~L(0,l)  with  the 
range  (~oo,+oo). 

The  logistic  data  was  generated  for 
paired  comparisons  with  3,  4,  or  5,  items  to  be 
compared.  Replications  of  5,  10,  and  25,  at  each 
of  the  item  combinations  were  considered.  Ordered 
sampling  of  both  (i,j)  and  (j,i)  was  performed  as 


68 


indicated  by  the  results  in  Chapter  3.  Each 
parameter  estimate  was  then  programmed  separately, 
using  the  appropriate  equation  given  in  Chapter  4. 


5.2  Results  for  Ji  ■ 

In  order  to  estimate  Ji  •  with  simulated 
data,  we  need  to  set  at  some  predetermined 

value  based  on  the  fi  ^  ’  s .  By  presetting  the  ^  ’  s 
we  have  a  basis  for  comparing  the  estimates  with 
the  original  predetermined  /j^’s.  The  /ij’s  were 
selected  so  that  the  effect  of  different  sample 
sizes  on  the  estimate  could  be  revealed  with  only 
one  different  (see  Table  5.1),  then  with  2,  3, 

and  4  /i-’s  being  different  (see  Tables  5.2  and 

5.3).  Sorting  of  the  variables  was  necess?„.j 
because  of  the  order  statistics  contained  in  the 


estimate  (3.1.8) 


After  summing  the  linear 


combinations  of  order  statistics  over  all  j^i  we 
can  determine  how  close  our  estimate  is  to  the 
preset  p s.  Recall  that  the  parameter  can  only  be 
estimated  up  to  a  scale  factor,  so  we  do  not 


expect  the  estimates  to  be  identical  to  the 


69 


preestablished  values  but  to  be  in  similar 
proportion  to  them. 

Tables  5.1  to  5.3  reflect  the  results 
of  the  simulation  for  /fj .  As  expected,  the 

estimates  are  closer  in  proportion  to  the  true 
preset  values  of  (t .  as  the  sample  size  increases. 
The  estimated  values  obtained  from  identical 
preset  /i-’s  are  quite  similar  in  most  of  the 
examples  while  the  relative  proportions  for  the 


Table  5 . 1 

Comparison  of  3  Items 


Preset 

Estimated 

2  '3 

Parameter 

Value 

Value 

5 

Mi 

1 

0.354 

M2 

1 

0.658 

Ms 

-2 

-1 .620 

10 

Mi 

1 

0.290 

m2 

1 

0.378 

Ms 

-2 

-1 . 129 

25 

Mi 

-2 

-1  .  129 

M2 

1 

0.374 

Ms 

1 

0.370 

70 


Table  5 . 2 

Comparison  of  4  Items 


Parameter 

Preset 

Value 

Est imated 
Value 

5 

Pi 

1 

0.115 

P2 

1 

0.513 

P3 

1 

0.209 

P4 

-3 

-1 .620 

10 

Pi 

1 

0.387 

P2 

1 

0.300 

P3 

1 

0.087 

P4 

-3 

-1.340 

25 

Pi 

1 

0.231 

P2 

1 

0.206 

P3 

1 

0.302 

P4 

-3 

-1.080 

Table 

5.3 

Compar i son 

of  5  Items 

Preset 

Estimated 

Parameter 

Value 

Value 

5 

Pi 

1 

0.184 

P2 

1 

0.295 

P  3 

-2 

-0.368 

P4 

-1 

-0.195 

Ps 

1 

0.169 

10 

Pi 

1 

0.180 

P2 

1 

0.200 

P3 

-2 

-0.604 

P4 

-1 

-0.190 

Ps 

1 

0.239 

25 

Pi 

1 

0 . 192 

P2 

1 

0.193 

P  3 

-2 

-0.484 

P4 

-1 

-0 . 197 

Ps 

1 

0.218 

Table  5.4 

Comparison  of  5  Items  (Different  preset  p’s) 


Rep 1 i cat  ions 

Parameter 

Preset 

Value 

Est i mated 
Value 

25 

Hi 

-2 

-0.489 

Hi 

1 

0.193 

Hz 

1 

0.201 

Ha 

1 

0.199 

Hz 

-1 

-0.187 

25 

Hi 

-2 

0.353 

Hi 

3 

-0.594 

Hz 

1 

-0.162 

Ha 

-1 

0.165 

Hz 

-1 

0.160 

25 

Hi 

2 

-0.338 

Hi 

2 

-0.300 

Hz 

-4 

1.000 

Ha 

1 

-0.119 

Hz 

-1 

0.154 

p^’s  set  at  different  levels  are  not  quite  as 
accurate.  With  an  increase  in  sample  size  these 
tend  closer  to  the  correct  proportions  indicated 
by  the  preset  values. 


72 


5.3  Results  on  <f>  1  and  Density  estimation 
procedures  for  estimating  ^  3-1(u)  (optional) 

The  estimate  developed  for  4>~ 1  as 
shown  in  equation  2.2.1  was  programmed  using  the 
basic  logistic  distribution  for  the  sample  data. 
Since  this  function  provides  a  one-to-one  mapping 
from  (0,1)  onto  the  real  numbers,  we  would  expect 
the  sample  data  to  reveal  a  logistic  cdf  when 
graphed.  Since  <£(v)=- — then  ^-1(p)  is  the 
logistic  cdf  and  ^(v)  is  the  logistic  density.  As 
is  typical  for  most  estimates,  the  larger  the 
sample  size,  the  smoother  the  curve.  Each  of  the 
graphs  in  Figures  5.1  to  5.3  were  plotted  based  on 
the  data  generated  under  the  simulation  discussed 
in  Section  5.1.  Since  these  graphs  are  quite 
close  to  a  true  logistic  cdf,  it  would  appear  that 
the  estimate  developed  through  this  research  is 
adequate  for  moderate  sample  sizes. 

Following  the  consideration  of  4>~l ,  we 
further  explored  the  data  through  density 
estimation  procedures.  The  kernal  estimator 
(Silverman,  1986)  was  used  with  a  normal  kernal  to 
graph  <t>’  (v)  obtained  from  the  simulated  data.  This 


73 


-.2  0  .2  .4  .6  .8  1  1.2 

Figure  5.1:  Phi  Inverse  with  3  Treatments  and. 


25  Replications 


Figure  5.2:  Phi  Inverse  with  4  Treatments  and 
25  Replications 


74 


estimator  is  defined  by  Silverman  to  be 

*00  =  F7H  £  K(t)  (5.3.1) 

i  =1 

where  h  is  the  window  width,  bandwidth,  or 
smoothing  parameter.  Various  bandwidths  were 
considered  as  suggested  by  Silverman.  Figures  5.4 
to  5.9  show  the  simulated  data  using  4  treatments 
and  10  replications,  Figures  5.10  to  5.16  also  use 
4  treatments  but  with  25  replications  and  finally, 
Figures  5.17  to  5.22  show  4  treatments  with  50 
replications.  In  each  of  these  three  situations 
with  differing  sample  sizes,  the  first  bandwidth 
is  too  small  revealing  spurious  structure,  while 
the  last  one  or  two  figures  in  each  set  use  too 
large  a  bandwidth  obscuring  the  true  nature  of  the 
distribution.  Further,  then  we  see  that  the 

graphs  shown  in  Figures  5.7,  5.13  and  5.18  reveal 

a  logistic  density  with  slightly  heavier  tails 
than  that  of  a  true  normal  density  function. 

Silverman  discusses  the  asymptotic 
properties  of  the  kernal  estimator  and  “global 
accuracy”  of  f  using  the  mean  integrated  square 
error  (MISE)  given  by 


76 


Figure  5.4:  Normal  Density  Smoothing,  Bandwidth  =0.1 

10  Replications 


Figure  5.5:  Normal  Density  Smoothing,  Bandwidth  =  0.2 

10  Replications 


Figure  5.6:  Normal  Density  Smoothing,  Bandwidth  =  0.3 

10  Replications 


Figure  5.7:  Normal  Density  Smoothing,  Bandwidth  =0.4 

10  Replications 


8 


Figure  5.15:  Normal  Density  Smoothing,  Bandwidth 

25  Replications 


nr 


[•!»] 


dwidth 


Figure  5.19:  Normal  Density  Smoothing,  Bandwidth 

50  Replications 


84 


MISE(f)  =  E  /{f(x)-f(x)}’, 


(5.3.2) 


dx . 

This  can  be  further  decomposed  into  two  elements 
as  shown  by  Silverman:  the  integrated  square  bias 

and  the  integrated  variance. 

MISE(f)  =  J  MSEx(f(x))dx 

=  /  |e  (f(x))-f(x)j2dx  +  J  var  f (x)dx 

(5.3.3) 

Silverman  uses  this  then  to  show  that,  for  a  given 
f ,  the  bias  term  does  not  depend  on  the  sample 
size  directly,  but  depends  only  the  the  weight 
function,  that  is  the  kernal  estimator.  When  the 
sample  size  is  increased,  the  bandwidth  will  need 
to  be  adjusted  accordingly  in  order  to  obtain  the 
most  appropriate  smoothing  function.  This  can 
easily  be  seen  from  the  graphs  shown  in  Figures 
5.4  to  5.21  where  the  sample  size  ranges  from  120 
data  points  for  Figures  5.4  to  5.9  to  600  data 
points  for  Figures  5.16  to  5.21. 


86 


Since  "the  normal  density  kernal 
estimator  satisfies  the  required  properties  of 
boundedness  and  bounded  variation  as  well  as  the 
usual  probability  density  conditions,  we  can  use 
arguments  given  by  Silverman  (attributed  to 
Bert rand-Retal i) .  This  leads  to  the  statement 
that  wi . h  probability  1, 


sup  |f(x)-f(x)|  —  0  as  n— co .  (5.3.4) 


Based  on 
est imator 
asymptot  i  c 


this,  we  conclude  that 
used  the  the  current 
uniform  consistency. 


the  kernal 
study  has 


87 


CHAPTER  6 


APPLICATION  OF  THE  MODEL  TO  A  LIKERT  SCALE 

6.1  Introduction 

Recently,  Gerhard  Tutz  (1986) 
discussed  "the  use  of  an  ordered  response  in  the 
Brad  1  ey-Terry  "type  of  paired  comparison 
experiment.  This  extends  the  traditional  model  of 
preference  or  no  preference  to  one  where  the 
strength  of  preference  is  also  characterized.  His 
approach  uses  random  utility  functions  and 
develops  specific  parameters  within  the  framework 
of  a  weighted  least-squares  method.  In  this 
chapter  we  will  use  the  minimum  mean  squared  error 
estimate  for  <t>~ 1  obtained  in  Chapter  2,  to  develop 
ad  hoc  estimates  for  the  fK’s  used  in  the  Likert 
Scale.  A  second  approach  to  approximating  the  ^  ’  s 
will  employ  weighted  least  squares. 

The  Likert  Scale  was  originally 
developed  by  Rensis  Likert  (1932)  and  his 
associates  at  the  Division  of  Program  Surveys  of 
the  US  Department  of  Agriculture.  They  applied 
sampling  methods  to  the  study  of  attitudes  in 


88 


order  to  develop  a  technique  for  measuring 

individual’s  feelings.  Likert  was  the  first  to 
apply  the  method  of  internal  consistency  to 

attitude  measurement  and  use  a  summated  rating 
system  to  eliminate  the  need  for  a  group  of  judges 
to  sort  statements  (Young,  1966).  The  Likert 
technique  uses  a  series  of  questions  concerning  a 
particular  area  or  problem.  Each  question 

requires  an  answer  from  one  of  five  words  such  as 
strongly  approve,  approve,  undecided,  disapprove, 
or  strongly  disapprove.  These  can  be  replaced 
with  strongly  agree,  agree,  no  opinion,  disagree 
and  strongly  disagree  or  by  almost  always,  always, 
frequently,  occasionally,  rarely,  and  almost 
never,  as  necessary  to  fit  the  nature  of  the 
question.  A  score  is  alloted  to  each  of  the 
possible  choices  and  the  total  score  for  each 
subject  is  obtained  by  simply  summing  the  score 
for  each  question. 

The  Likert  Scale  was  compared  to  one 
developed  by  Thurstone  (Edwards  and  Kenney,  1946) 
and  was  found  to  have  higher  reliability  with 
fewer  items  to  consider.  The  Likert  technique  is 


89 


also 

eas  ier 

to 

administer  and  less 

t i me -consuming . 

Th  i  s 

scale 

i  s 

used  frequently  in 

social  science 

and 

educat i 

onal 

areas,  as  well  as 

in  industry  to 

measure  worker  satisfaction,  etc.  Combining  the 
use  of  paired  comparisons  on  a  continuous  response 
scale  with  the  Likert  technique  could  provide 
insight  into  the  theory  behind  this  widely  used 
scale . 


6.2  Ad  Hoc  Procedure  for  Applying  the  Model  to 
the  Likert  Scale 

In  our  original  model  <t>  mapped  the 
real  numbers  onto  [0,1]  ,  but  in  applying  <t>  to  the 


Likert  scale  technique 

it  will 

map  the  reals  onto 

integer  values. 

For 

example 

,  for  the  five  8 

values : 

«vij>  = 

-2 

if 

«o<V.  j  <9X 

II 

> 

' — ' 

-1 

if 

«i<Vij  <9, 

«vij>= 

0 

if 

92<Vi  .<93 

*<vij)= 

1 

if 

93<Vi.<9A 

II 

/"“N 

•n 

> 

2 

if 

for  -oo=90<  9t<  .  .  .< 

eA<  e5 

=oc 

90 


Here  v^j  is  "the  same  logistic  random  variable  as 
defined  in  Chapter  2  (see  Definition  2).  The  0^’s 
could  be  symmetrically  placed  for  convenience 
which  results  in  9y=-9A  and  02=-03.  The  two 

parameters,  90  and  9S ,  could  be  replaced  by  -oo  and 
+oo ,  respectively.  This  gives  us  only  two 

parameters  to  solve  for,  91  and  02 . 

The  original  minimum  mean  squared 
error  estimate  for  4>~ 1  (2.2.1)  is  modified  to  allow 

for  the  discontinuous  case  as  follows: 


)-(/'i-/<j)-Fv1ij( 


r  J 

E- 

S=2 


(p+l)/2 
n-  •  ) 


*-'(r-Zp)= 


SS  F^ij^r-(p+l)/2> 


k(k-l) 


where  qjj  =  p(<Ky)  <  r~Ep) 

s=l 


;  1  J 

S— (p+0/2 


n  ■ 


i  J 


(6.2.2) 


p  is  the 


total  number  of  0’s  in  the  model,  and  r  is  the 
value  of  the  ith  9  parameter. 


By  using  the  estimate  for  <£-1  (y)  =v  ^  j  , 
we  can  map  any  vjj  to  its  appropriate  y  value, 
similar  to  the  example  provided  in  6.2.1.  Simple 
estimates  of  the  values  of  9X .  .  ,9P  are  found  as  the 


91 


midpoints  between  each  of  the  1  (y)  values  for 
—  oo<y<oo . 

Table  6.1  provides  the  values  for  4>~l 
and  its  estimate  as  given  in  equation  6.2.2.  The 
estimate  is  quite  close  to  the  value  of  the 
parameter  for  each  of  the  combinations  of  numbers 
of  treatments  to  be  compared  and  number  of 
replications.  Using  these  five  parameter  values 
the  midpoints  are  computed,  thereby  determining 
the  0.’  s.  This  results  in  the  9^  values  given  in 
Table  6.1. 

Since  the  0  parameters  are  to  be 
symmetric  with  9X  =  -9A  and  02  =  -03,  we  can  average  the 
estimates  for  9X  and  -04 ,  and  for  02  and  -03 .  In 
addition,  we  let  90=-oo  and  05  =+oo.  After  averaging 
the  values  for  each  9 j  in  all  of  the  cases 
displayed  in  Table  6.1,  the  values  are  presented 
in  Table  6.2  (second  column). 


92 


Table  6 . 1 

4>~l  and  9  Estimates  for  Five  Parameters 


Number  of 
Treatments 

H|Maii£R 

■ 

^_1 

1 

?i 

3 

10 

-2 

-2.0515 

-1 

-1 .0885 

1 

-1 .5700 

0 

-0.2310 

2 

-0.6598 

1 

1 . 0885 

3 

0.6285 

2 

1 .8265 

4 

1 .4575 

4 

10 

-2 

-2.0515 

-1 

-1 .0517 

1 

-1 .5516 

0 

-0.0817 

2 

-0.5065 

1 

1  .  1334 

3 

0.5259 

2 

2 . 0688 

4 

1  .  150 

4 

10 

P 

B| 

-0.986 

1 

-1.5728 

p 

-0.027 

2 

-0.5065 

H 

1 .003 

3 

0.488 

2 

2.0194 

4 

1  .  150 

5 

10 

-2 

-2.0660 

-1 

-1 .0796 

1 

-1 .5728 

0 

-0.0778 

2 

-0.5787 

1 

1.0962 

3 

0.5092 

2 

2.0194 

4 

1 .5578 

5 

25 

-2.0639 

-1 .0215 

1 

-1.5427 

0 

-0.0085 

2 

-0.5150 

1 

1  .0013 

3 

0.4960 

2 

2.0998 

4 

1 .5456 

Table  6.2 


Average  9 ^  Values 


o . 

1 

Averaged 
Over  All 

5  Cases 

Averaged  Over 
Cases  with 

25  Replications 

3ract i cal 

o. 

h 

-1.55 

-1.52 

-1 .5 

-0.57 

-0.51 

-0.5 

0.49 

0.49 

0.5 

1 .54 

1  .53 

1.5 

If,  however,  we  average  only  -the  cases 
with  25  replications,  we  can  see  the  9 .  parameters 
drawing  closer  to  a  truly  reflective  situation,  as 
expected.  For  practical  purposes  we  could 
determine  the  0^’s  to  be  those  given  in  the  final 
column  of  Table  6.2  These  then  would  be  the  cut¬ 
off  points  to  determine  the  appropriate  y  value 
for  a  given  region.  For  example,  if  the 
comparison  resulted  in  a  V-j=-0.35,  it  would  be 
mapped  to  0  on  the  Likert  Scale,  revealing  no 
difference  between  the  two  items  compared.  If 
V^j=3.5,  then  it  would  be  mapped  to  2,  indicating  a 
strong  preference  for  the  first  item  in  the 
comparison.  Some  scaling  adjustments  may  need  to 


94 


be  made  since  the  paired  comparison 
only  accurate  to  a  scale  paramater. 
used  in  this  discussion  centers  the 
0,  for  convenience.  As  shown  in  d 
previous  chapters,  the  values  of 
change  the  relative  position  of  the 
real  1 i ne . 


estimates  are 
The  example 
scale  around 
iscussions  in 

6ij=/ii~^j  can 
V  •  j  ’  s  on  the 


6.3  Weighted  Least  Squares  Approach  to 

Determining  0.’  s 

A  more  theoretical  approach  to 
determining  the  0-’s  involves  the  use  of  weighted 
least  squares  as  described  by  Beaver  (1977)  for 
univariate  models  involving  paired  comparisons. 
In  the  current  research  we  have  a  transformation 
that  is  linear  in  the  log  scale,  which  fits  the 
model  given  in  Beaver’s  article.  Here  the  sample 
proportions  are  given  by 


Pi  = 


-(#*;-#*  i+*0 

1+e  J 


P2 


+02)  -Ui-/*i+0i) 

1+e  J  1+e  J 


95 


p3 


1+e  J  1+e  J 


P  4  = 


-Or,i.+04)  _(M  ^  +^3) 

1+e  J  1+e  J 


Ps  =  1  ~ 


1+e  J 


(6.3.1) 


By  considering  the  following  linear 
combinations  of  these  proportions,  the  logit 
transformation  can  be  used  to  produce  a  model 
linear  in  #».,  jij  and  01...04: 


Pi 


1+e 


1 _ 


P1+P2 


_ 1 _ 

1+e  J 


Pl  +  P2  +  P3  = 


_ 1 _ 

-(Pi-A*i+®3> 
1+e  J 


P1  +  P2  +  P3  +  P4 


_ 1 _ 

1+e  J 


(6.3.2) . 


Since  the  sum  of  the  proportions  must  be  1, 
5 

)Tpj=l;  hence  the  linear  combinations  in  6.3.2 
i=i 


96 


represent  4  independent  linear  functions  of  the  5 
multinomial  probabilities.  When  the  logit 
transformation  In  ^  ~  is  applied  to  px ,  we  have 


Pi  _ 


=  In 


Oj-Z'j+tfi) 

1 

-  Oi-Pj+^i) 


=  In 


=  in 


so  that 


(6.3.3) 


Similarly  using  logits,  02 ,  8  3  and  04  can  be 
expressed  in  a  logl inear  form  as  follows: 


ln  ( i-(£?P,))  = 

ln  (  rrffffk))  = 


In  f  P1  +  P2  +  P3  +  P4  \  U.-U.  +  0 

\l-(Pl  +  P2  +  P3  +  P4>J  1  * 


(6.3.4) 


97 


Following  the  approach  in  Section  6.2,  we  consider 
the  0^’s  to  be  reflective  with  $1  =  -94  and  02  =  -03. 

We  again  consider  the  pairs  ( i , j ) 
l<i<j<t  and  ( j  ,  i  )  1  <  j  <  i  <t  with  t(t-l)  pairs  in 
all.  The  basic  model  given  by  Beaver  for  the 
logl inear  situation  is 


F(ir)  =  Kln(A?r) 


(6.3.5) . 


where  &\.=  (Vij’  Vij’  •••  Vij)  and 

s!  =  (in.  •  •  •  4,t-l>  ; 

K=  I  ®  K0  and  A=  I  ®  A0 ; 
and  ®  denotes  the  right  direct  matrix  product. 
For  an  arbitrary  pair  (i,j)  ,  with  i<j ,  we  have 


AoEij 


1  0  0  0  0  - 

0  1111 
110  0  0 
0  0  111 
1110  0 

0  0  0  1  1 
11110 
0  0  0  0  1 


Pi 

P2 

p3 

?4 

p5 

(6.3.6) 


98 


The  above  then  results  in 
Fij(P>  =  K0ln(AoEij)  = 


Kn 


In  px  -  In  (p2+  P3+P4+P5) 
In  (px+p2)  -  In  (P3+P4+P5) 

In  (Pi+p2+p3) -In  (p4+p5) 

In  (P1+P2+P3+P4) -In  (p5) 


whe  re 


Kq  = 


1  -1  0  00000 
0  01  -1  0000 
0  00  01-100 
0  00  0001  -1 


(6 


Th i s  gives 
f o 1 1 ow i ng : 


F(j2n)  = 


for  more  specific  pairs, 

1  -1  0...0  1  0  0  O-i 

1-1  0...00100 
1  -1  0...  000  1  0  0 

1-1  0... 00001 


3.7) 


the 


99 


1  0  -1  0  .  .  .  0  1  0  0  0 


F(£i3) 


1  0  -1  0  ...  0  0  1  0  0 

1  0  -1  0.  ..000  1  0  (3 

10  -10. ..00001 


and  similarly  for  all  of  "the  (i,j)  pairs  with  i  <  j  , 
up  to  (t,t-l)  as  given  below: 


Combining  6.3.5  thru  6.3.7  provides  a  design 
matrix  in  the  form  F(e)  =  Xj3  where 


100 


al 

a2 

a3 


h 

e2 

^3 


and  t  is 

the  number  of 
treatments  being 
compared . 


The  design  matrix  takes  the  form 


X 


1-10 

1-10 

1-10 

1-10 

10-1 

10-1 


0  1 
0  0 
0  0 
0  0 
0  1 
0  0 


0  0 
1  0 
0  1 
0  0 
0  0 
1  0 


0  0 
0  0 
0  0 
1  0 
0  0 
0  0 


0  0...  1  -1  00  1  00 
0  0...  1  -1  000  1  0 


Hence  we  now  have  a  model  linear  in  the  parameters 
Qi  j  ’  '  -0t-l»  and  the  weighted  least  squares 


101 


"technique  as  applied  by  Beaver  is  applicable. 
Without  any  loss  of  generality,  we  let  = 

l<i<j<k,  since  these  parameters  are  location 
parameters.  This  simplification  makes  the 

computation  slightly  easier,  although  we  could 
also  solve  for  the  locations  parameter,  •  ,  if 
these  parameters  were  of  interest  at  this  point  in 
the  research.  .  In  terms  of  the  Likert  scale, 

however,  the  fl^’s  are  of  major  concern  and  we 
concentrate  on  this  aspect  of  the  model  for  the 
remainder  of  this  section. 

Using  the  data  generated  as  explained 
in  Section  5.1,  we  can  determine  the  estimates  for 
91 ,  @2  ?  ^3  >  and  04 ,  using  a  weighted  least  squares 
analysis.  Each  comparison  of  (i,j)  and  (j,i) 
generates  estimates  for  each  of  the  (L  ’  s  from  the 
replications  at  each  comparison.  These  are  then 
averaged  over  all  of  the  pairs  to  obtain  an 
estimate  for  9lf  02 ,  93 ,  and  9A .  Table  6.3  shows  the 
results  of  this  simulation.  The  parameters  are 
quite  close  to  the  original  Likert  Scale  values 
with  reflectivity  of  the  parameters,  that  is  9x=-9 A 
and  93=-92 ■  Of  course  with  smaller  sample  sizes  the 


102 


estimates  are  not  quite  as  accurate  and  reveal 
less  reflectivity. 

Any  Vjj  in  the  interval  [^2^3]  »  would 
be  mapped  to  5  in  the  y  scale,  implying  no  true 
difference  exists  between  the  items  being 

compared.  A  V^j  in  the  interval  [03  >#4]  >  would 
indicate  a  prefence  for  the  first  of  the  items 


compared , 

wh  i  1  e 

a  Vjj  in 

[0X ,  02] 

reveals  a 

preference 

for 

the  second 

item . 

St  rong 

pref e  rences 

are 

shown  for  the 

f  i  rst 

or  second 

item  by  obtaining  a  value  of  V-j  greater  than  04  or 
less  than  6X ,  respectively. 

These  findings  support  the  interpretation 
of  the  Likert  Scale  and  provide  a  more  theoretical 
basis  for  the  widespread  conclusions  frequently 
made  from  studies  using  this  technique.  Although 
simulations  have  revealed  that  the  parameter 
estimates  are  valid  for  moderate  sample  sizes,  it 
would  be  important  to  actually  use  these  estimates 
in  a  practical  study. 


03 


TABLE  6.3 


Treat¬ 

ments 

Re pi ica- 
t  i  ons 

9 

Val  ue 

4 

10 

Si 

-1 .5607 

-1 .0517 

s3 

1 . 1334 

0* 

1 .3327 

3 

25 

Si 

-2.0310 

Si 

-0.9892 

S  3 

1 .0239 

S 4 

1 .9753 

4 

25 

Si 

-2.0214 

02 

-0.9857 

S3 

1 .0030 

S< 

2.0270 

5 

25 

Si 

-2.0639 

s3 

-1 .0215 

S3 

1 .0013 

Sa 

2 . 0898 

104 


BIBLIOGRAPHY 


1.  Beaver,  R.J.  (1974).  “Locally  asymptotically 
most  stringent  tests  for  paired  comparison 
experiments.”  J.  Amer  .  Stat  i  st  .  Assn  .  .  69:423-27. 

2.  Beaver,  J.J.  and  Gokhale,  D.V.  (1975).  “A 
model  to  incorporate  within  pair  order  effects  in 
paired  comparisons.”  Commun i cat i ons  i  n  Stat i st i cs . 
4:923-39. 

3.  Beaver,  R.J.  and  Rao ,  P.V.  (1972).  “The  use 
of  limit  theorems  in  paired  and  triple  comparison 
Model  Building.”  Journal  of  Mathemat i cal 
Psycho 1 ogv .  9:92-103. 

4.  Beaver,  Robert  J.  (1977).  “Weighted  least 
squares  analysis  of  several  univariate  models.” 
J .  Amer .  Stat i s .  Assn . .  72:629-34. 

5.  Boos,  Dennis  D.  (1979).  “A  differential  for 
L-stat i st i cs . ”  Annals  of  Stat ist i cs .  7:955-9. 

6.  Bradley,  R.A.  and  Terry,  M.E.  (1952).  “The 
rank  analysis  of  incomplete  block  designs  I.  The 
method  of  paired  comparisons”,  B iomet r i ka .  39:323- 
45. 


7.  Bradley,  R.  A.  (1976).  “Science,  statistics 
and  paired  comparisons”,  B i omet r i cs .  32:-213-32. 

8.  Charnet  R.  and  Beaver  R.J.  (1988).  “Design 
and  analysis  of  paired  comparison  experiments 
involving  mixtures.”  J_j_  Stat  i  st .  Plann .  Inf .  . 
20:91-103. 

9.  Chernoff,  Herman,  Gastwirth,  Joseph  L.,  and 

Jones,  M.V.  (1967).  “Asymptotic  distribution  of 
linear  combinations  of  functions  of  order 
statistics  with  applications  to  estimation.” 
Annals  Statistics .  38:52-72. 

10.  David,  H.A.  (1969).  The  Method  of  Pa i red 
Compar i sons .  Second  Edition,  Griffin,  London. 

11.  David,  H.A.  (1981).  Order  Stat i st i cs .  Wiley 
Series  in  Probability  and  Statistics. 


105 


12.  Davidson,  R.  R.  (1970).  “On  extending  the 
Bradley-Terry  model  to  accomodate  ties  in  paired 
comparison  experiments.”  J.  Amer  Stat  i  st .  Assn .  . 
65:317-28. 

13.  Edwards,  Allen  L.  and  Kenney  Kathryn  C. 
(1946).  “A  comparison  of  the  Thrustone  and  Likert 
scale  techniques  of  attitude  scale  construction.” 
Journal  of  Add 1 i ed  Psvcho logy  30:72-83. 

14.  El-Helbawy,  A.T.  and  Ahmed  E.A.  (1984), 
“Optimal  design  results  for  2  factorial  paired 
comparison  experiments.”  Commun , cat i on  i  n 
Stat i st i cs .  13(22)  2827-2845. 

15.  Ford,  L.R.  Jr.  (1957).  “Solution  of  a 
ranking  problem  from  binary  comparisons.” 
Amer i can  Math .  Month  1 v .  64(8)  :28-33. 

16.  Hogg,  R.V.  and  Craig,  A.T.  (1978). 
I nt  roduct i on  to  Mathemat i cal  Stat i st i cs .  Fourth 
Edition.  Macmillan  Publishers. 

17.  Likert,  Rensis.  “A  technique  for  the 
measurement  of  attitudes.”  Arch i ves  of  Psvcho 1 ogv 
Co  1 umb i a  Un i vers i tv  Press .  No. 140,  1932. 


18.  Luce,  R.D.  Ind i vidual  Cho i ce  Behav i or :  A 

Theoret i cal  Anal vs i s .  New  York:  Wiley,  1959. 


19.  Moestellar,  F.  (1951). 
methods  of  paired  comparisons: 
squares  solution  assuming 
deviations  and  equal 

Ps vc hornet r i ka .  16:3-9. 


“Remarks  on  the 
I  .  The  1  east 
equal  standard 
cor re  1  at  ions.” 


20.  Pendergrass  R.N.  and  Bradley  R.A.  (1960). 
“Ranking  in  Triple  Comparisons.”  Cont r i but  ions  to 
Probab i 1 i tv  and  Stat i st i cs .  Stanford  University 
Press  :  331 -51 . 


21.  Rao ,  P.V.  and  Kupper,  L.L.  (1967).  “Ties  in 
paired-comparison  experiments:  A  generalization 

of  the  Bradley-Terry  Model.”  J_i.  Amer  ■  Stat  i  st . 
Assn  ,  62:194-204. 


106 


22.  Shorak,  Galen  R.  (1972).  “Convergence  of 
quantile  and  spacing  processes  with  applications.” 
Annal s  of  Math .  St at .  43:1400-11. 

23.  Silverman,  B.  W.  Dens i tv  Est i mat i on  for 
Stat i st i cs  and  Data  Anal vs i s .  London  and  New 
York:  Chapman  and  Hall,  1986. 

24.  Sirotnik,  B.W.  and  Beaver,  R.J.  (1984). 
“Paired  comparison  experiments  involving  all 
possible  pairs.”  Br i t i sh  Journal  of  Math .  and 
Stat  i  st .  Psvcho  I  oerv  .  37:22-33. 

25.  Stigler,  S.M.  (1974).  “Linear  functions  of 
order  statistics  with  smooth  weight  functions.” 
Annals  of  Stat i st i cs .  2:676-693. 

26.  Thurstone,  L.L.  (1972).  “A  law  of 
comparative  judgement.”  Psvcho 1 og i cal  Rev i ew . 
34:273-86. 

27.  Tutz ,  G.A.  (1986).  “Brad  1 ey-Terry-Luce 
models  with  an  ordered  response.”  Journal  of 
Mathemat i cal  Psvcho 1 ogv .  30:306-16. 

28.  Young,  Pauline  V.  Sc i ent i f i c  Soc i al  Surveys 
and  Research .  New  Jersey:  Prentice-Hall,  1966. 


